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Starry messages 


The first scientific observations with telescopes displaced Earth from the centre of the Universe. 
Modern technology continues to humble us but should not distance us from the cosmos itself. 


our perception of it did, thanks to Galileo Galilei’s scru- 

tiny of the night sky with a telescope. Within a couple 
of years, his observations of the Moon, phases of Venus and 
satellites of Jupiter shattered the old Ptolemaic model of our 
Solar System. To the church's dismay, Earth assumed its rightful place 
as one of several planets orbiting the Sun (see page 28). 

Marking Galileo's anniversary, the International Year of Astronomy 
seeks to remind us of the humbling nature of gazing at the heavens. 
Through programmes offering cheap telescopes for the masses, teach- 
ing materials and global heritage projects, astronomers are inviting 
us all to look up and ponder our place in the Universe. 

That this prospect is novel speaks to the distance that has come 
between people and their environment. Urban light pollution means 
that one-fifth of the world’s population can no longer see the Milky 
Way with the naked eye. Many city kids, even if they did peer through 
the orange smog above their heads, would probably see only a hand- 
ful of stars. Astronomers are calling for the dark places on Earth to 
be preserved as national parks, so that we do not lose completely our 
window on the Universe (see page 27). 

Astronomers, in communicating their work, have a natural ally in 
beautiful images. The Hubble Space Telescope has a place in the pub- 
lic’s heart for this reason — although it has accomplished outstanding 
science too (see page 41). But professional astronomers themselves are 
not immune to distancing. The technological advances that have given 
so much insight into astrophysics have made astronomy one of the 
most computer-intensive of the natural sciences. Surveys churn out 
terabytes of data to be crunched by armies of postdocs and students 


Pos hundred years ago, the Universe changed. Or, at least, 


ASTRO YERR 


who might never have visited a mountain-top observatory. 
Could data mining soon replace serendipitous observational 
discovery? Astronomers will anyway need new visualization 
tools, as demonstrated by a three-dimensional graphic in this 
issue (see page 63), to hunt for the unexpected. 

Since Galileo, advancing telescope technology has made our 
human perspective seem ever more myopic. We now know that 


the Milky Way is but one of many _,, : ; 
galaxies scattered throughout the One-fifth of the world’s 
population can no longer 


Universe. Space is littered with fear- 
some objects, from giant blackholes see the Milky Way with 
the naked eye.” 


to energetic y-ray bursts. Planets are 
plentiful around other stars. Even 
the normal matter that makes up our bodies is only a minority con- 
stituent of the Universe — most matter is exotic and dark. 

The next generation of telescopes (see page 18) will push astron- 
omy into anew realm. Within two decades the whole night sky will be 
scanned continuously and recorded digitally across many wavebands. 
Trips to observatories will become obsolete, except for detailed follow- 
up work. Astronomical data will be generated on an industrial scale and 
mined from a laptop. Yet the new vistas revealed will still surprise. 

Every now and then, someone predicts the ‘end of science’ It was 
believed in the 1890s that there were just a few problems left to solve 
in physics: yet Albert Einstein solved three of them in 1905, and 
founded three new branches of physics. In the next 40 years, never 
mind the next 400, astronomy will change. It is not the end, but the 
beginning ofa new phase. The Universe will continue to humble us, 
if we take the time to look. a 


Experts still needed 


There are good reasons to be suspicious of 
metric-based research assessment. 


any countries are keen to measure their universities’ 

M research performance with minimal burdens on the par- 

ticipants. Not least of these is the United Kingdom, which 

last month announced the results of its sixth and final Research 
Assessment Exercise (RAE, see page 13). 

The RAE relied heavily on expert peer review of research publica- 
tions, and attention in Britain and beyond is now focused on what 
form the replacement system will take. The proposed successor, the 
Research Excellence Framework (REF), is opaque. Little is known 
about how it will work other than a central principle: it will assess 
research quality using metrics, including publication citations. It may 
also take into account the number of postgraduates completing their 


studies and the amount of research income won by universities. There 
will be a smattering of ‘light-touch expert review, although the exact 
form that this will take is not yet clear — it might simply be used to 
interpret the metrics results. 

But taken alone, publication citations have repeatedly been shown 
to bea poor measure of research quality. An example from this jour- 
nal illustrates the point. Our third most highly cited paper in 2007, 
with 272 citations at the time of inspection, was of a pilot study in 
screening for functional elements “)Jetrics are not well 
of the human genome. The impor- established for the 


tance lay primarily in the technique. eee ‘ z 
In contrast, a paper from the same applications of science. 


year revealing key biological insights into the workings of a proton 
pump, which moves protons across cell membranes, had received 10 
citations. There are plenty more examples of such large disparities 
between papers that may be important for a variety of reasons: tech- 
nological breakthroughs of immediate use to many, more rarefied 
achievements of textbook status, critical insights of relevance to small 
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or large communities, ‘slow burners’ whose impact grows gradually 
or suddenly after a delay, and so on. 

Such isolated statistics serve to illustrate a point that has been 
more systematically documented in the bibliometrics literature. 
Take, for example, an analysis of the correlation between judgements 
of scientific value using metrics, including citations, and those using 
peer review, in condensed-matter physics (E. J. Rinia et al. Res. Policy 
27, 95-107; 1998). The study found disagreements in judgement 
between the two methods of evaluation in 25% of the 5,000 papers 
examined. In roughly half of these cases, the experts found a paper 
to be of interest when the metrics did not, and in the other half, the 
opposite was the case. The reasons for the differences are not fully 
understood. 

It is also important to note that the use of metrics as an evalua- 
tion method does not have widespread support within the scientific 
community. Some members of the expert panels judging work in this 


year’s RAE warn of the dangers to the quality of research assessment 
under a metrics-based model. These fears were expressed even by 
experts in subject areas thought to be most appropriate for metrics- 
based assessment, such as biology and chemistry. Metrics are not 
well established for the applications of science, or for disciplines less 
dependent on journal publication. 

Britain is not alone in encountering problems in developing robust 
metric indicators of research quality. The Australian government is 
also dealing with a backlash from some universities and leading 
researchers against its current attempts to do the same thing. 

The signs are that, after several false starts and delays, the final pro- 
posals for the REF, due in autumn 2009, are unlikely to be the radical 
departure from the RAE that the government first envisaged in 2006. 
Expert review is far from a problem-free method of assessment, but 
policy-makers have no option but to recognize its indispensable and 
central role. a 


A public service 


The Christmas bird count is a model to be emulated 
in distributed, volunteer science. 


bird count, a yearly rite in which groups of North American 
bird-lovers pick a day around the winter solstice, fan out in 
teams to their designated areas, and count every bird that they see. 

Held every year since 1900, when the National Audubon Society 
proposed it as an alternative to the then-popular Yuletide activity 
of competitively shooting birds, the count is the longest-running 
volunteer science project in the world. Its data have informed reams 
of peer-reviewed work, such as an ongoing effort by Audubon 
researchers to predict how birds will adjust their ranges in response 
to climate change. 

The count has served as a model for any number of volunteer science 
efforts. Such projects now flourish — not least because the Internet 
makes it so easy for scientists to find, recruit and coordinate the vol- 
unteers. Out in the field, examples range from Project BudBurst, in 
which participants report on the timing of climate-influenced botani- 
cal events such as flowering and leafing, to the Great World Wide Star 
Count, in which astronomy buffs check the number of stars visible in 
certain bright constellations as a way of monitoring light pollution. 

Indoors, meanwhile, network-based projects include Folding@home, 
in which millions of users allow their idle home computers to be used 
to simulate protein folding, and Galaxy Zoo, in which participants 
use their prowess at pattern recognition to classify the millions of 
galaxies captured in telescopic images — something that still flum- 
moxes computers. 

The lesson of this list is that the world is full of enthusiastic people — 
and that the opportunities for researchers to tap into this enthusiasm 
are limited only by their own imaginations. Volunteer science is a 
win-win situation for all concerned. Scientists get to take on projects 
that would not be feasible for even the largest research group, while 
helping to increase the public’s understanding of, and support for, 


Ts 5th of January marks the completion of the 109th Christmas 


8 


science. And the volunteers get to have fun, while experiencing the 
satisfaction of defending the environment, fighting disease or expand- 
ing human knowledge. 

So researchers should think creatively about whether the data they 
need, or the crunching or sorting they must do, can be outsourced to 
members of the public. And while they are at it, perhaps they should 
also consider joining one or more citizen science projects themselves. 
Participation in such efforts can reconnect scientists consumed with 
grant-writing and project management with the ‘doing’ of science. 
In the Christmas bird count, the most skilled bird spotters and iden- 
tifiers are inevitably the non-scientists; professional ornithologists 
spend too much time doing paperwork. And, of course, volunteering 
for science feels good, especially when you see a black oystercatcher, 
say, or two merging galaxies — something fun, beautiful and rare. m 


ANNOUNCEMENT 


Evolutionary gems 


About a year ago, an Editorial in these pages urged scientists and 
their institutions to ‘spread the word’ and highlight reasons why 
scientists can treat evolution by natural selection as, in effect, an 
established fact (see Nature 451, 108; 2008). 

This week we are following our own prescription. Readers 
will find at www.nature.com/evolutiongems a freely accessible 
resource for biologists and others who wish to explain to students, 
friends or loved ones just what is the evidence for evolution by 
natural selection. Entitled ‘15 evolutionary gems, the document 
summarizes 15 lines of evidence from papers published in Nature 
over the past 10 years. The evidence is drawn from the fossil record, 
from studies of natural and artificial habitats, and from research on 
molecular biological processes. 

Ina year in which Darwin is being celebrated amid uncertainty 
and hostility about his ideas among citizens, being aware of the 
cumulatively incontrovertible evidence for those ideas is all the 
more important. We trust that this document will help. 
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Obama's picks underline climate focus 


Strong roles for biologists as the president-elect chooses his science and technology team. 


John Holdren, a leading voice on climate change 
at Harvard University, will serve as science 
adviser to US president-elect Barack Obama. 
And Jane Lubchenco, a strong advocate of 
marine conservation at Oregon State University 
in Corvallis, will head the National Oceanic and 
Atmospheric Administration (NOAA). 

Holdren, as is normal for US science advis- 
ers, has a background in physics. At the time of 
his appointment, two eminent biologists were 
named to co-chair the President’s Council of 
Advisors on Science and Technology (PCAST) 
with him. One will be Harold Varmus, the 
former director of the National Institutes of 
Health, who led Obama’s science advisory team 
during the campaign. The other is Eric Lander, 
founding director of the Broad Institute in 
Cambridge, Massachusetts, who helped lead 
the push to sequence the human genome. 

Together, these appointments underscore 
how Obama is choosing experienced aca- 
demics for positions in government once he 
becomes president on 20 January. 

If confirmed by the Senate, Holdren will 
replace John Marburger as head of the Office 
of Science and Technology Policy (OSTP), 
which helps set research agendas and 
budgets across multiple federal agencies. 
Holdren will also hold the title of Assistant 
to the President for Science and Technology; 
this is seen by some as a clear step up from the 
title Marburger enjoys, Science Advisor to the 
President (see Nature 455, 453; 2008). 


Holdren’s appointment quickly won praise 
from other academics. “He’s competent and 
at the forefront of so many fields,” says Ralph 
Cicerone, an atmospheric scientist and presi- 
dent of the National Academy of Sciences in 
Washington DC. “He’ encyclopaedic, he's quick 
and he’s deep.” Representative Vernon Ehlers 
(Republican, Michigan), a physicist, adds: “He's 
an excellent choice, a good scientist, and I think 
he will serve the president and the 
country well? 

Holdren also directs the 
Woods Hole Research Center in 
Falmouth, Massachusetts. Now 
an environmental-policy specialist, in his early 
career he worked as an engineer at Lockheed 
Missiles and Space Company in Sunnyvale, 
California, and as a fusion scientist at the Law- 
rence Livermore National Laboratory, also in 
California. He worked on nuclear weapons 
and non-proliferation issues while chairman 
of the executive committee for the Pugwash 
Conferences on Science and World Affairs — 
a familiarity with this issue that is in keeping 
with past science advisers. “He is comfortably 
in the same sort of mould that we've had for 
many decades,” says John Gibbons, a science 
adviser to President Bill Clinton, “except for the 
unusual extent in the depth of his involvement 
in the process of science in government.” 

Holdren is no stranger to Washington, 
having served on the PCAST to Clinton. In that 
capacity, he chaired a number of major reports 


Looking to the future: John Holdren, has long pushed for changes to US energy policy. 
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meant to guide energy policy, including one that 
promoted a move away from coal and towards 
nuclear energy and renewables, says Charles 
Vest, president of the National Academy of 
Engineering. “All of the things that you expect 
in 2008 — but that you might not have seen on 
the agenda about a decade ago,’ says Vest. 
For another PCAST report, Holdren 
delivered a solo presentation to Clinton on 
non-proliferation strategy in 
the early era of post-Soviet Rus- 
sia, says Gibbons. “It was very 
coherent, expert — it answered 
the question authoritatively,’ he 
says. “That ultimately led to a US position on 
plutonium disposition a couple of weeks later 
when the president went to Moscow.’ Holdren’s 
understanding of how the White House works 
will serve him well, says Neal Lane, another 
former Clinton science adviser. 


Rare skill 
In recent years, Holdren has been a tireless 
advocate for improving US energy policy, 
travelling the world and lecturing on science’s 
role in sustainability. He has said that reading 
The Challenge of Man's Future by Harrison 
Brown — which looked at the science of 
sustainability — in high school set him on a 
path to working on the intersection between 
science, technology and society. He has even 
appeared on David Letterman’s late-night show 
to discuss the science of global warming. 

Daniel Schrag, a climate scientist and 
Harvard colleague, says that Holdren’s wide- 
ranging interests belie his depth of knowledge 
in anumber of areas. “John has a remarkable 
ability to survey vast areas of scientific infor- 
mation and distil them down to their essence — 
that’s a very rare skill,” says Schrag. “He knows 
what he doesn't know, and he knows who to 
ask.” Schrag also says that Holdren is skilled 
at getting people from different backgrounds 
together; as co-chairman of the National 
Commission on Energy Policy, Holdren led a 
bipartisan group including business and indus- 
try leaders to produce a document in 2004 on 
how to “end the energy stalemate”. Holdren 
understands that fixing climate change comes 
with big costs, says Schrag. His familiarity 
with Lawrence Summers, the former Harvard 
president who will head the National Economic 
Council, might help him to that end. 

One unresolved question is how all the 
climate and energy specialists in the Obama 
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Marine star: Jane Lubchenco will run the National 


Oceanographic and Atmospheric Administration. 


administration might work together. Nobel- 
prizewinning physicist Steven Chu will head 
the Department of Energy; former Environ- 
mental Protection Agency chief Carol Browner 
is in a new White House position overseeing 
climate and energy policy. “It'll all have to be 
coordinated very carefully,’ says Lane. 

Lubchenco, like Holdren, is a past president 
of the American Association for the Advance- 
ment of Science. A marine ecologist, she has 
spoken out against overfishing and done 
research in the hypoxic, or dead, zones that can 
be caused in some areas by fertilizer run-off. As 
head of NOAA she will have jurisdiction over 
a wide range of issues including the National 
Marine Fisheries Service and the National 
Weather Service. 

Dealing with fisheries “will be one of her 
major challenges’, says John Byrne, who headed 
NOAA during the administration of Ronald 
Reagan and is a former president of Oregon 
State University. Industry groups are wary of 
her pro-conservation stances, but Byrne thinks 
that Lubchenco is up to the challenge of run- 
ning the agency. “It needs someone to respond 
to issues such as climate change, pressure on 
coastal zones, ocean pollution and over-fishing 
ina firm way. She will do that,’ he says. 

Another appointment in Obama's environ- 
mental team is Ken Salazar, a Democratic senator 
from Colorado, to head the Department of the 
Interior. Salazar is known as a middle-of-the- 
roader who has protested against Bush adminis- 
tration plans to expand oil-shale development in 
the American west, but who has also supported 
offshore drilling. The agency oversees the US 
Fish and Wildlife Service, which has been buf- 
feted in recent years over its handling of species 
listings under the Endangered Species Act. ™ 
Eric Hand and Alexandra Witze, with 
additional reporting by Ashley Yeager. 


Universities struggle as 
value of endowments falls 


Some large research In some ways, the big universities have 
universities have suffered been a victim of their own success. As 
double-digit declines in endowments experienced double-digit 
their multibillion-dollar increases nearly every year through the 
endowments since the 1990s and 2000s, universities have become 
autumn credit crunchand ever more dependent on them in their 
RECESSION stock market collapse. overall budgets — even as endowment 
Wales The endowments have payouts remained roughly constant 
also become increasingly cash-poor at around 5% per year. Ten years ago, 
precisely when they might be expected Harvard’s endowment paid for a third of 
to compensate for downward pressures the operating budget of the school of arts 
on academic budgets. The declines and sciences; now it covers more than half 
are presenting universities with tough of the $1.16-billion budget. 
choices: fire sales of endowment assets, Endowments are suffering now partly 
or budgetary trimming in the form of because of the very investment methods 
pay cuts, freezes on hiring and deferred they used to beat the broader markets year 
construction projects. after year. Universities with endowments 
“It’s a very big problem,” says John bigger than $1 billion — there were 76 
Walda, president of the National of them in the United States in 2007, 
Association of College and University according to NACUBO — kept much of 
Business Officers (NACUBO) in their money in more volatile ‘alternative’ 
Washington DC. “A good number [of investments that are ‘illiquid’ or difficult 
endowments] are seeing about a 30% to convert into cash quickly, such as hedge 
decline since last year.” funds and venture capital. By comparison, 
Harvard University announced in institutions with endowments of less than 


December that its endowment, the world’s $25 million had a nearly opposite, more 
largest and worth $36.9 billion at mid-year, conservative approach, with much of 
had dropped 22% from that amount by the __ their money being kept in cash and 


end of November — with worse expected fixed-income assets (see chart). 
to come. As of mid-December, the world’s John Nelson, an analyst with Moody’s in 
second-largest endowment, that of Yale New York, says the alternative-investment 
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Walking into the unknown: Harvard University’s endowment has lost 22% of its value since June, and may lose more. 


18 December by the National 
Association of Independent 
Colleges and Universities, half 

of the 371 college and university 
presidents who responded reported 
freezing new hiring. In addition, 
22% reported freezing salaries, 
and more than two-thirds were 
planning to raise tuition fees for 
the next academic year. Their 
manoeuvres stem partly from 
falling endowments, but also from 
concerns such as greater difficulty 
in raising funds and declining 
student enrolment numbers. 

The endowment phenomenon is 
largely American, where there is a 
long tradition of private fund-raising 
in higher education. In Europe, 
the only two universities with 
multibillion-dollar endowments 
are Oxford and Cambridge in the 
United Kingdom. By and large, 
big research universities outside 
the United States are much more 


portfolios of big universities have fared 
better than the Standard and Poor's 500 
share index, which dropped 31% from 
30 June to mid-December. 

But alternative investments, in addition 
to being illiquid, can also suck up cash 
from elsewhere. Hedge-fund managers, for 
example, require periodic new investments 
knownas ‘cash calls: These represent a 
problem at a time when universities need 
cash for salaries and construction projects 
and credit is hard to come by. “The tough 
choice that endowment managers face now 
is, ‘when are we going to be forced to sell 
some of these assets in the down market?” 
says Nelson. Some managers, such as those 
at Harvard, are already trying to sell assets 
off for far less than they had been worth, 
according to reports in The New York Times 
and The Wall Street Journal. 

Even knowing what the alternative 
assets are worth is a challenge. Yeshiva 
University in New York City revealed that 
$110 million in investments — 8% of the 
university’s endowment — had evaporated 
in the investment scheme created by 
Bernard Madoff, who was arrested on 11 
December and charged with fraud that 
may have cheated investors of $50 billion 
(see ‘Medical charity folds after investment 
losses, page 16). The current climate, says 
Walda, “will lead to more scrutiny of the 
underlying value of investments”. 

The effects on departmental hallways 
have been varied and will probably play out 
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over several years, as most endowments 
base their payouts on a principal amount 
smoothed by a three-year rolling average. 
Harvard has puta partial hiring freeze in 
place. Stanford University in Palo Alto, 
California, will trim its budget next year 
by 5% by postponing new construction, 
controlling salaries and cutting some jobs. 
The chancellor of Washington University 
in St Louis, Missouri, has 

promised to take a symbolic 

pay cut of nearly 10% off 

his half-a-million-dollar 

salary. The Georgia Institute 

of Technology in Atlanta 

is trimming its landscaping budget; the 
University of Hawaii is turning off its 
weekend air-conditioning; and the Field 
Museum of Natural History in Chicago, 
Illinois, is cutting its budget by 15% after the 
value of the museum's endowment plunged 
by nearly US$100 million (31%) in the 
past six months (see Nature doi:10.1038/ 
news.2008. 1333; 2008). 


Making do 
At the California Institute of Technology 
in Pasadena, endowment managers had 
started putting more money into cash and 
cash-equivalent assets in September, says 
spokesman Jon Weiner, and the university 
expects to weather the storm without a 
hiring freeze. 

Few, however, are as optimistic about 
their prospects. In a survey released on 


© 2009 Macmillan Publishers Limited. All rights reserved 


dependent on public funding, 
says Thomas Estermann of the European 
University Association in Brussels. “Losing 
a higher percentage of public funding will be 
much worse for them,” he says. That is also 
the case for small universities in the United 
States, which are worried about budget cuts 
at the state level because their endowments 
wont be bailing them out. The 1,600 private 
US institutions have a median endowment of 
only $16 million, according 
to 2007 data from the US 
Department of Education. 
Most public universities have 
similarly small endowments, 
with the exception ofa 
handful of monster-sized ones, such as those 
of the University of Texas and the University 
of California systems. 

Nearly a year ago, ina very different 
climate, universities were facing scrutiny in 
the US Congress. Representatives wanted 
to know why college tuition fees kept rising 
even though some universities, bolstered 
by successful endowments, seemed to be 
getting richer. They proposed legislation 
that would mandate state universities to 
spend 5% of the value of their endowment 
assets each year — for instance to reduce 
tuition fees — as is the case for private 
foundations. But that legislation has not 
yet got anywhere. Many will be watching 
to see if endowments, the cash cows for 
universities for so long, can sustain them 
through the long economic winter. 

Eric Hand 
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Good grades, but who gets the cash? 


Britain's Research Assessment Exercise finds excellence more widespread than a focus 


on elite institutions would suggest. 


British universities are bubbling with speculation 
about how the annual £1.5 billion (US$2.2 bil- 
lion) in government funding for research will 
be distributed, following the nation’s most 
extensive audit of research quality. 

Of the 52,400 academic researchers from 159 
universities that entered the Research Assess- 
ment Exercise (RAE), published on 18 Decem- 
ber, 17% were regarded as world-leading 
(rated 4*), and 37% as internationally excellent 
(rated 3*). These results will largely determine 
the way that higher-education funding coun- 
cils allocate the money in the academic years 
spanning 2009-14. But universities will have to 
wait until 4 March 2009 to find out exactly how 
the results translate into cash. 

The British government and the funding 
councils have long maintained a policy concen- 
trating funding on the best research. Depart- 
ments with high ratings are usually awarded 
more money per volume of staff than those with 
lower ranks, with funds divided according to an 
algorithm drawn up by the funding councils of 
England, Scotland and Wales. This has tradi- 
tionally resulted in the same 25 or so institutions 
winning around 80% of the available funding, 
a situation that is unlikely to change this time 
round, according to one former vice-chancellor 
ofa research-intensive university. 

But the latest RAE results show that highly 
rated research is spread much more widely 
than that core of 25. Forty-nine universities 
had at least some 4* research in their submis- 
sions, and at least half of the submissions from 
118 universities were rated 4” or 3*. This raises 
fears that the funding will be spread too thinly 
— or that some top-quality research will not 
be supported. “This could seriously erode Brit- 
ain’s position as a world leader, particularly in 
biomedical sciences, in which the United King- 
dom is second only to the United States,” says 
Steve Smith, principal of the faculty of medi- 
cine at Imperial College London. 

Concentrating the funding at the top will also 
leave little cash for the next class of research, 
designated as internationally recognized (2*), 
forcing funding councils to choose between 
their commitment to reward excellence wher- 
ever it is found, and selectively targeting their 
research money. “I don’t think they can do 
both, and they now have a big problem,” the 
former vice-chancellor says. 

“The policy is to fund the best properly and 
then see what is left for the rest; confirms 


a source connected to previous RAEs. He 
anticipates that 2* work will receive at least 
some funding — there would be a great back- 
lash otherwise, he says — but “my suspicion is 
that the money will certainly run out before 1* 
[nationally recognized work],” he says. 

Work submitted to the RAE is judged by peer 
review, using experts from many countries to 
ensure robust quality comparisons with the 
rest of the world. However, the RAE does not 
directly compare UK researchers or institutions 
against their overseas competitors. William 
Schowalter, a chemical engineer from Princeton 
University in New Jersey and one of the RAE’s 
international judges, says that he is “convinced” 
the benchmarks were set appropriately. 

“We can be confident that the results are con- 
sistent with other benchmarks indicating that 
the United Kingdom holds second place glo- 
bally to the United States in significant subject 
fields,’ says David Eastwood, chief executive 
of the Higher Education Funding Council for 
England (HEFCE), which runs the RAE. 

Nature analysed RAE data on the top-per- 
forming universities in four key disciplines (see 
graph). The University of Cambridge ranked 
highest in both physics and biological sciences. 
And in chemistry, 40% of the university’s sub- 
missions achieved a 4* grade — 10% more than 
its nearest rival. 

Of the four subjects, physics had the broadest 


departments made more RAE submissions 
than any other subject, with five universities — 
including Leeds, Manchester and Cambridge 
— submitting more than 100 academics for 
assessment. 

This year’s RAE is the sixth and final exercise 
of its kind to be run in Britain. The government 
decided in 2006 to replace it with the Research 
Excellence Framework, which will rely more 
heavily on metrics such as publication citations 
to judge the quality of research. The move has 
been prompted by the expense and administra- 
tive burden that peer review places on insti- 
tutions: the 2008 assessment cost HEFCE £12 
million to run, more than twice as much as the 
previous RAE in 2001. 

The changes will be closely watched by other 
nations that already use, or are in the process 
of implementing, similar research assessment 
systems, including Hong Kong, Australia and 
New Zealand. 

A spokesman from Hong Kong's University 
Grants Committee, which allocates funding on 
the basis of the region’s own RAE, told Nature 
that it is “conscious of the burden the RAE places 
on institutions”. The committee is already con- 
sulting academic institutions about the assess- 
ment process, and “one of the factors we shall 
take into account is the release of the UK RAE 
[2008 results] and the reaction to it, as well as 
further developments in the metrics model the 
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Single fund for African science aid 


Cash pool could break power struggles between competing agencies on the continent. 


Plans to funnel much of the 
science, technology and education 
aid bound for Africa into a single 
fund are taking shape. Such a move 
could boost the flow of cash from 
donors and improve coordination 
of science development activities 
on the continent. 

But critics say that the proposed 
central fund — which could man- 
age billions of dollars in aid — is 
simply being used as a pawn ina 
power struggle between agencies 
coordinating development spend- 
ing on the continent. 

Since an African science and 


African Union members could get development aid through a central fund. 


simply seeking to control financial 
flows, not necessarily to get any- 
thing done” 

Elwaer rejects the notion that 
the AU is exerting improper con- 
trol over Africa's science agenda. 
But there are certainly tensions 
between the AU and NEPAD 
that have hampered progress on 
their joint strategy. In November 
2007, a growing dispute between 
the AU science directorate and 
the NEPAD science office went 
public when officials engaged in 
a vocal row during a ministerial 
meeting in Mombasa, Kenya, over 


technology fund was mooted in an 

action plan adopted by the continent's science 
ministers in 2005, progress to that end has been 
slow. However, a pan-African science-policy 
meeting in Abuja, Nigeria, on 3-5 December 
heard that the African Development Bank is 
investigating whether it can host the fund, with 
a decision expected around March 2009. 

Hakim Elwaer heads the science directorate 
of the African Union (AU), the confederation 
representing 53 African states. He told the 
Abuja meeting that the lack of a transparent 
mechanism for African funding is a key reason 
why many millions of dollars pledged for sci- 
ence by donors, such as the G8 (Group of 8) 
leading industrialized nations, have been slow 
to reach African researchers. 

“The moment you have a fund structure, 
with an efficient, transparent and credible 
management system, this gives more credibil- 
ity for donors to put money in — especially 
the big development partners from Europe, 
Japan, America and the World Bank,” he says. 
He expects the majority of the contributions to 
the centralized fund to come from governmen- 
tal and intergovernmental donors, aid agencies 
and African countries them- 
selves, and emphasizes that 
the fund will not replace bilat- 
eral funding arrangements 
with individual countries. 

A centralized funding system could 
strengthen Africa’s role in defining its own 
development priorities, says Myles Wickstead, 
senior adviser with the Africa Unit of the Asso- 
ciation of Commonwealth Universities and 
visiting professor of international relations at 
the Open University in Milton Keynes, UK. 

And Elwaer hopes that the fund could also 
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“Many organizations 
are simply seeking to 
control financial flows." 


improve coordination of science development 
activities on the continent, which range from 
financial support of individual research 
projects to science-literacy programmes in 
African parliaments. This should counter an 
increasing duplication of efforts as intergov- 
ernmental agencies, eager to capitalize on the 
willingness of donors to spend money on Afri- 
can science, have drawn up science strategies 
independently. 


Astonishing duplication 

Researchers from the Network of African 
Science Academies are already working on 
identifying the overlaps between the three 
biggest science strategies: the 2005 action 
plan nurtured by the AU and its implement- 
ing partner, the New Partnership for Africa’s 
Development (NEPAD); a second drawn up 
by the United Nations Economic Commission 
for Africa; and a third produced by the African 
Development Bank. 

“We were absolutely astonished that these 
three major initiatives had been developed 
rather independently,” says Mohammed 
Hassan, executive director of 
the Academy of Sciences for 
the Developing World. 

But Calestous Juma, a 
Kenyan professor of interna- 
tional development at Harvard University, 
says that efforts to coordinate development 
activities are not tackling the root of the prob- 
lem: big agencies are jostling to set the science 
agenda on the continent. 

“Is duplication the reason why little is being 
achieved? I don’t think so,” he says. “Many 
of the organizations using this language are 
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rival efforts to measure science 
activity on the continent. 

One reason for the tensions is historical. 
NEPAD was created in 2001, with the encour- 
agement of G8 donors, whereas the AU was 
established in 2002 and took several years to 
find its feet. By the time that the AU was put 
in charge of policy, with NEPAD handling 
implementation, NEPAD had already spent 
years doing both. “We've never understood 
if the AU or NEPAD should take the lead,” 
says Jimmy Whitworth, director of interna- 
tional activities at the Wellcome Trust, a UK 
medical-research charity. 

But in a closed meeting at the Abuja confer- 
ence, representatives of the two agencies met 
to try to bury the hatchet. “The meeting in 
Abuja seems to have been marked by a recog- 
nition that the AU and NEPAD have to work 
together,’ says Wickstead. 

That should significantly improve the 
prospects of the African Development Bank 
fund getting off the ground, although there 
are many hurdles still to clear. By mid-2009, 
the AU hopes to have a clearer idea of how 
the fund would be managed and what sorts 
of programmes it would support. This would 
then be discussed and signed off by African 
science and education ministers. 

And there is the added pressure of the 
economic downturn, which could cause 
donor contributions to dry up. “The timing is 
awkward,’ admits Sibry Tapsoba, the African 
Development Bank manager for science. 
“Because of the financial crisis, everybody could 
be saying that resources are lacking. We are very 
positive, but we are also very cautious.” | 
Linda Nordling writes about African science 
policy for SciDev.Net. 
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Medical charity folds after 
investment losses 


The Picower Foundation, a large US charity 
that funds medical research, has collapsed 
after losing money in a fraudulent US$50- 
billion (£34-million) investment scheme 
run by businessman Bernard Madoff. 

The Florida-based foundation was set up 
in 1989 and had distributed a total of $268 
million in grants, including a $50-million 
donation in 2002 to fund the Picower 
Institute for Learning and Memory at the 
Massachusetts Institute of Technology in 
Cambridge. The foundation had assets of 
$1 billion invested in Madoff’s scheme, and 
it is unclear whether any of these funds will 
be recovered. 

Other organizations hit by the fraud 
include Tufts University in Medford, 
Massachusetts, which had put $20 million 
into the scheme, and the American 
Technion Society, a science-education 
charity in New York, which had madea 
$72-million investment. 


Spanish solar companies 
slammed over subsidy fraud 


Many of the solar parks that stretch across 
the Spanish countryside are guilty of fraud, 
Spain’s National Energy Commission 
(CNE) has found. 

In the past two years, Spain’s solar 
industry has grown by 900%. The country 
now has the third-largest solar capacity 
in the world, behind the United States 
and Germany. But the CNE’s investigation 
has found that nearly 4,200 photovoltaic 
installations, found on 13% of Spain’s solar 
parks, were falsely registered as producing 
power by 30 September 2008 in order 
to receive greater subsidies from power 
companies. 

Industry analysts are concerned that the 
fraudulent installations have effectively 


Subsidies may run short for Spain's solar parks. 


16 


First antenna switches on inthe Atacama 


The Atacama Large Millimeter/ 
submillimeter Array (ALMA) — a radio 
telescope — has received the first 
of its 12-metre-wide radio antennas 
(pictured). The project is a multinational 
collaboration with a cost well in excess 
of US$1 billion. It will eventually consist 
of 54 identical antennas — along with 
12 seven-metre antennas — at asite 
5,000 metres above sea level in Chile's 
Atacama Desert. 
ALMA will be used to study the 
Universe at millimetre and submillimetre 
wavelengths, providing anew window 
on the cool star-forming regions of the 
Milky Way and on early galaxies. Further 
antennas should be delivered throughout 
2009, and the entire array is expected 
to be completed by the end of 2012 (see 
page 18). 


used up the subsidies anticipated for 2009 
—and on 17 December the government 
postponed a round of awards planned for 
January until 1 March. 

For a longer version of this story, see http://tinyurl. 
com/9nxa5h. 


US firms power up to build 
advanced batteries 


Fourteen US companies have formed a 
coalition to make advanced lithium-ion 
batteries for vehicles. The National Alliance 
for Advanced Transportation Battery Cell 
Manufacture is proposing to build one or 
more large manufacturing plants that would 
be shared by its members. 

Members of the alliance include Altair 
Nanotechnologies of Reno, Nevada, 3M 
of Saint Paul, Minnesota, and Johnson 
Controls-Saft, a joint American—French 
battery-production enterprise. The US 
Department of Energy’s Argonne National 
Laboratory in Illinois will have an advisory 
role. The alliance is seeking government 
support for the majority of the venture, 
estimated to cost between $1 billion and 
$2 billion over five years. 


Health organization lays 
plans for major biobank 


Health-care provider Kaiser Permanente 
based in Oakland, California, is building a 
biobank that could rival the world’s largest 
in size and scope. The organization has 
received a grant of US$8.6 million from 
the philanthropic Robert Wood Johnson 
Foundation in Princeton, New Jersey, to 
collect DNA specimens from 200,000 people 
enrolled in Kaiser’s medical-care plan. 
Kaiser will link the DNA to participants’ 
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existing electronic medical records and 

to a planned database of environmental 
information, with the aim of reaching a total 
of 500,000 participants by 2012 — making 
it much larger than any other current 
biobank in the United States. The data will 
be available to researchers around the world 
for studying how genes and other factors 
influence disease risk and health. 

For a longer version of this story, 

see http://tinyurl.com/9bvr5d. 


NASA awards contracts to 
supply space station 


Orbital Sciences of Dulles, Virginia, and 
Space Exploration Technologies (SpaceX) 
of Hawthorne, California, have secured 
contracts from NASA to ferry cargo to the 
International Space Station (ISS) from 2010. 

The contracts, which could each be 
worth up to US$3.1 billion, will see the 
private carriers supply 40-70% of the space 
station’s US cargo each year until the end 
of 2016. The firms will be stepping into the 
breach left when NASA retires its space 
shuttles in 2010. 

In September 2008, SpaceX launched the 
first privately developed liquid-fuel rocket 
and its cargo into low Earth orbit. The 
company’s ISS deliveries will probably be 
made with Falcon 9, a larger version of the 
rocket, which is expected to make its first 
launch within months. 


Correction 

The News Feature ‘Phoenix: a race against time’ 
(Nature 456, 690-695, 2008) suggested that 
Honeybee Robotics failed to follow revised 
designs that would have corrected a flaw ina part 
of the NASA Phoenix spacecraft. A subsequent 
investigation by mission managers at NASA's Jet 
Propulsion Laboratory found that Honeybee was 


not at fault. See Correspondence, page 26. 
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Medical charity folds after 
investment losses 


The Picower Foundation, a large US charity 
that funds medical research, has collapsed 
after losing money in a fraudulent US$50- 
billion (£34-million) investment scheme 
run by businessman Bernard Madoff. 

The Florida-based foundation was set up 
in 1989 and had distributed a total of $268 
million in grants, including a $50-million 
donation in 2002 to fund the Picower 
Institute for Learning and Memory at the 
Massachusetts Institute of Technology in 
Cambridge. The foundation had assets of 
$1 billion invested in Madoff’s scheme, and 
it is unclear whether any of these funds will 
be recovered. 

Other organizations hit by the fraud 
include Tufts University in Medford, 
Massachusetts, which had put $20 million 
into the scheme, and the American 
Technion Society, a science-education 
charity in New York, which had madea 
$72-million investment. 
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more large manufacturing plants that would 
be shared by its members. 

Members of the alliance include Altair 
Nanotechnologies of Reno, Nevada, 3M 
of Saint Paul, Minnesota, and Johnson 
Controls-Saft, a joint American—French 
battery-production enterprise. The US 
Department of Energy’s Argonne National 
Laboratory in Illinois will have an advisory 
role. The alliance is seeking government 
support for the majority of the venture, 
estimated to cost between $1 billion and 
$2 billion over five years. 


Health organization lays 
plans for major biobank 


Health-care provider Kaiser Permanente 
based in Oakland, California, is building a 
biobank that could rival the world’s largest 
in size and scope. The organization has 
received a grant of US$8.6 million from 
the philanthropic Robert Wood Johnson 
Foundation in Princeton, New Jersey, to 
collect DNA specimens from 200,000 people 
enrolled in Kaiser’s medical-care plan. 
Kaiser will link the DNA to participants’ 
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existing electronic medical records and 

to a planned database of environmental 
information, with the aim of reaching a total 
of 500,000 participants by 2012 — making 
it much larger than any other current 
biobank in the United States. The data will 
be available to researchers around the world 
for studying how genes and other factors 
influence disease risk and health. 

For a longer version of this story, 

see http://tinyurl.com/9bvr5d. 


NASA awards contracts to 
supply space station 


Orbital Sciences of Dulles, Virginia, and 
Space Exploration Technologies (SpaceX) 
of Hawthorne, California, have secured 
contracts from NASA to ferry cargo to the 
International Space Station (ISS) from 2010. 

The contracts, which could each be 
worth up to US$3.1 billion, will see the 
private carriers supply 40-70% of the space 
station’s US cargo each year until the end 
of 2016. The firms will be stepping into the 
breach left when NASA retires its space 
shuttles in 2010. 

In September 2008, SpaceX launched the 
first privately developed liquid-fuel rocket 
and its cargo into low Earth orbit. The 
company’s ISS deliveries will probably be 
made with Falcon 9, a larger version of the 
rocket, which is expected to make its first 
launch within months. 


Correction 

The News Feature ‘Phoenix: a race against time’ 
(Nature 456, 690-695, 2008) suggested that 
Honeybee Robotics failed to follow revised 
designs that would have corrected a flaw ina part 
of the NASA Phoenix spacecraft. A subsequent 
investigation by mission managers at NASA's Jet 
Propulsion Laboratory found that Honeybee was 


not at fault. See Correspondence, page 26. 


ALMA (ESO/NAOJ/NRAO) 


P. GINTER/SCIENCE FACTION/GETTY IMAGES 


Medical charity folds after 
investment losses 


The Picower Foundation, a large US charity 
that funds medical research, has collapsed 
after losing money in a fraudulent US$50- 
billion (£34-million) investment scheme 
run by businessman Bernard Madoff. 

The Florida-based foundation was set up 
in 1989 and had distributed a total of $268 
million in grants, including a $50-million 
donation in 2002 to fund the Picower 
Institute for Learning and Memory at the 
Massachusetts Institute of Technology in 
Cambridge. The foundation had assets of 
$1 billion invested in Madoff’s scheme, and 
it is unclear whether any of these funds will 
be recovered. 

Other organizations hit by the fraud 
include Tufts University in Medford, 
Massachusetts, which had put $20 million 
into the scheme, and the American 
Technion Society, a science-education 
charity in New York, which had madea 
$72-million investment. 


Spanish solar companies 
slammed over subsidy fraud 


Many of the solar parks that stretch across 
the Spanish countryside are guilty of fraud, 
Spain’s National Energy Commission 
(CNE) has found. 

In the past two years, Spain’s solar 
industry has grown by 900%. The country 
now has the third-largest solar capacity 
in the world, behind the United States 
and Germany. But the CNE’s investigation 
has found that nearly 4,200 photovoltaic 
installations, found on 13% of Spain’s solar 
parks, were falsely registered as producing 
power by 30 September 2008 in order 
to receive greater subsidies from power 
companies. 

Industry analysts are concerned that the 
fraudulent installations have effectively 


Subsidies may run short for Spain's solar parks. 
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First antenna switches on inthe Atacama 


The Atacama Large Millimeter/ 
submillimeter Array (ALMA) — a radio 
telescope — has received the first 
of its 12-metre-wide radio antennas 
(pictured). The project is a multinational 
collaboration with a cost well in excess 
of US$1 billion. It will eventually consist 
of 54 identical antennas — along with 
12 seven-metre antennas — at asite 
5,000 metres above sea level in Chile's 
Atacama Desert. 
ALMA will be used to study the 
Universe at millimetre and submillimetre 
wavelengths, providing anew window 
on the cool star-forming regions of the 
Milky Way and on early galaxies. Further 
antennas should be delivered throughout 
2009, and the entire array is expected 
to be completed by the end of 2012 (see 
page 18). 
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The new boss in town 


Barack Obama's transition team is hitting the ground 
running, and its speed and openness are winning praise, 


as David Goldston reports. 


election, I was in Chicago for a confer- 
ence. At about 4 p.m. on Saturday after- 

noon, I stopped at a crowded coffee shop on one 
of the city’s busiest streets and ran into two of 
president-elect Barack Obamas foreign-policy 
aides, who were working unobtrusively on their 
laptops at one of the small tables. “What are 
you doing here?” I asked, thinking that the first 
Saturday after a two-year campaign might bea 
day off. But they didn’t comment about being 
at work. They said that the campaign office had 
now shut down and the transition office wasn't 
ready yet — so they had to work somewhere. 

The story is emblematic of the Obama 
transition, which seems to have impressed 
just about everyone in Washington DC with 
its speed, diligence, expertise and openness. 
Although there have been reports of tensions 
at some agencies, particularly NASA, one usu- 
ally partisan White House official told me that 
the Obama team gave him “this squishy good- 
government feeling I don't get very often”. And 
someone who worked on the Clinton transi- 
tion noted that in 1992 “we didn’t even meet” 
until several weeks after election day. 

Although some top Obama aides remain in 
Chicago, the bulk of the transition operation 
is housed in a nondescript government build- 
ing in Washington. The setting is appropriate: 
although the level of excitement is high and the 
pace frenetic, the actual work is the day-to-day 
stuff of government — lots of meetings, writing 
papers, sending ideas up the chain of com- 
mand with uncertain results, and racing to meet 
deadlines. “Tt seems like there's a deadline every 
day,’ one of the transition stafflamented to me. 
“They obviously can't all be meaningful.” 

There are two types of transition team: policy 
teams that are organized by topic and designed 
to look at big-picture issues, and review teams 
for each federal agency. The names of all the 
teams and their members are available on the 
transition website (www.change.gov). In an 
unprecedented move, the site also posts all the 
written proposals that interest groups have sub- 
mitted. The transparency is refreshing and has 
been lauded by public-interest groups, although 
it could limit the kinds of ideas the transition 
teams hear. Some groups may be reluctant to 
‘think out loud’ or to offer ideas that diverge 
from their party line because everything 


Ts weekend after the US presidential 
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will become public. But no one seems to be 
complaining. “We may take an extra day before 
sending something in” to make sure it’s fully 
vetted, but the content is basically the same, 
said Tobin Smith of the Association of Ameri- 
can Universities. Unsurprisingly, transparency 
does not extend to internal recommendations 
to Obama. The transition team has not been 
making its own papers public, and its members 
have been told not to speak to the media. 

Although the transition team has been solic- 
iting ideas on every imaginable issue, in early 
December, as economic news became ever 
more grim, the stimulus package that Obama 
hopes to see enacted in January began to domi- 
nate. Putting together multibillion-dollar pro- 
posals at high-speed is a little “disconcerting’, 
one of the transition staff told me, noting that 
most of the team members are more used to 
worrying about budget constraints. Smith 
said Tom Kalil, a top transition official on 
science and innovation, called him on Monday 
8 December and said he needed any ideas for a 
stimulus bill by that Thursday. 

As the economy worsened, the criteria for 
what might be included in a bill apparently 
broadened. Smith said that earlier on, the teams 
seemed to be looking only for stimulus projects 
that could spend all their money in a year; by 
December that had stretched to three-year 
projects. “They realized that it can take projects 
a while to ramp up,’ said Smith, although they 
still want ideas that are “shovel-ready”. 

The transition team has been running its 
proposals past the career staff at the Office of 
Management and Budget (OMB), an OMB 
staffer told me. He said that the OMB has 
been asked to review technical matters such as 
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whether the numbers look realistic and how 
quickly the money could be spent. In policy 
areas in which the OMB and transition staff 
have a good and open relationship, discussions 
of the proposals can be extensive. The transi- 
tion teams have eliminated or altered spending 
ideas because of OMB guidance, he said. 

Before the stimulus temporarily shunted eve- 
rything else aside, the transition teams had been 
meeting with just about every group under the 
sun ona wide range of policies. Francesca Grifo, 
who heads the Union of Concerned Scientists’ 
work on scientific integrity, said she has partici- 
pated in about five meetings with teams cover- 
ing relevant agencies. “In some cases, we reached. 
out to them,” she said, “in others, they asked to 
meet with us” The Environmental Protection 
Agency transition team asked the group to set 
up a meeting of scientific and public-interest 
groups that had worked on scientific-integrity 
issues. Transition sources indicate that Obama 
is likely to issue an executive order on scien- 
tific integrity shortly after taking office. At all 
the meetings, Grifo said, the transition teams 
asked many questions, but were careful not to 
give away anything about their own thinking. 

Smith noted that science groups are playing 
a much more active role in this transition than 
in past ones, partly because Obama’s team has 
been open and partly because of concerns that 
have accumulated during the Bush adminis- 
tration. “We started being active during the 
campaigns,” he said. “And for the first time, we 
recommended names” of people for science 
and education posts. 

The teams have also been meeting with 
each federal agency. The transition team for 
the Office of Science and Technology Policy 
(OSTP) has met with all the political and career 
staff at the OSTP at least once, a White House 
official told me. The meetings have been candid 
and friendly, and have focused on decisions that 
will need to be made in the first 100 days, such as 
questions on the switch from analogue to digital 
television, the shutdown of the space-shuttle 
programme, and the Intergovernmental Panel 
on Climate Change. Classified matters have had 
to wait because the OSTP transition team does 
not yet have the necessary clearances. 

The official said it was very helpful that some 
of the OSTP transition team had served in the 
Clinton administration and understood the 
White House operation. But still, he said, at a 
human level, the process left him with a “funny 
feeling. You see the transition team walking 
around with their badges in a building you 
think of as your place, and you realize it’s not 
your place any more.” a 
David Goldston is a project director at the 
Bipartisan Policy Center in Washington DC. 
Reach him at partyofonecolumn@gmail.com. 
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NEW EYES, NEW SKIE 


The next 40 years will see telescopes that far outstrip any ever seen before. Jeff Kanipe profiles 


four of them; illustrations by Lynette Cook. 


he armillary and astrolabe are now seldom seen outside 

museums and antique shops; but the telescope, which 

joined them in the observatories of early modern Europe 

400 years ago, is still at the centre of the astronomical 
world. In optical precision, in the wavelengths that are used and 
in their sheer size, they have changed almost beyond recognition 
(see page 28). After two centuries in which they left no records other than 
the users’ sketches, and a century in which their visions were recorded 
on photographic plates, they have in the past decades become entirely 
electronic. And they are now stationed everywhere — oceans, deserts, 
mountain tops and all kinds of orbit. But the job is still the same: collect- 
ing and focusing whatever information the Universe sends our way. 

Yet for all its glorious 400-year history, the astronomical telescope’s 
best days may still be to come. Telescopes currently in development 
show an unprecedented degree of technical ambition as they seek to 
provide near-definitive answers to questions that, a generation or two 
ago, it was hard to even imagine investigating. 

To answer these questions, the telescopes profiled here will often work 
in complementary ways. The infrared capabilities of the James Webb 
Space Telescope and the radio acuity of the Square Kilometre Array will 
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both be used to probe the Universe at the time of its own first 
light — the birth of the first stars and galaxies. The radio array will 
map the large-scale structure of the Universe, elucidating the role 
in that structure of ‘dark matter’ and ‘dark energy, as will studies of 
the faintest galaxies by the Large Synoptic Survey Telescope and 
European Extremely Large Telescope. That behemoth and the 
orbiting Webb will, in turn, complement each other in their attempts to 
characterize planets around other stars with unprecedented detail. 

This quartet, for all its ambition and expense, does not exhaust the pos- 
sibilities being explored and wished for. The Atacama Large Millimeter/ 
Submillimeter Array will soon revolutionize astronomy at its chosen 
wavelengths. Other projects are planned throughout the electromag- 
netic spectrum and beyond into the new realms of gravitational waves 
and neutrinos. These instruments are all being designed with specific 
scientific challenges in mind. But at the same time, all concerned hope 
devoutly to discover something as strange and unlooked for as Galileo’s 
mountains on the Moon — or spots on the face of the Sun. 

Jeff Kanipe is a science writer based in Maryland. Lynette Cook is an 
artist and illustrator based in California. 
See Editorial, page 7, and online at www.nature.com/astro09. 
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ike the Hubble Space Telescope, to 

which it is in some ways the succes- 

sor, the James Webb Space Telescope 

(QWST) will be the orbital flagship of 
its generation. But whereas the Hubble sees 
mainly in the visible and ultraviolet, JWST 
is optimized for the infrared. That means it 
can see things hidden from the Hubble and 
its like by dust, and peer into the high-red- 
shift epoch just after the Big Bang at objects 
indiscernible at visible wavelengths — such 
as the first stars. 

Astronomers at the Space Telescope Sci- 
ence Institute in Baltimore, Maryland, started 
their first plans for a follow-on instrumentin 
1989 — a year before the Hubble itself was 
launched. It should finally make it to the 
launch pad 24 years later. Although its design 
and cost have changed a few times over the 
past two decades (see Nature 440, 140-143; 
2006), its main mission remains simple and 
visionary — to study uns: ect: 
phase of the his 
so, the telescope v 


weight optical systems made from beryllium, 
extremely sensitive infrared detectors anda 
cryocooler that can maintain the mid-infra- 
red detectors at a frosty 7 kelvin indefinitely. 

The most striking of the new technologies, 
though, affects the very heart of the telescope. 
JWST’s designers wanted a mirror that would 
have been too large to fit into the payload fair- 
ing of any rocket available. So they designed 
one in segments, a mirror that could be 
launched folded up and then deployed to its 
full 6.5-metre diameter once the telescope set- 
tles into its final orbit, 1.5 million kilometres 
from Earth. That distance gives the telescope 
much more sky to look at than the Hubble 
gets, and keeps it cooler, too. But it has its 
downside: as yet there is no way to get there to 
fix any problems so, unlike, the Hubble, JWST 
has to work perfectly from the start. 

At the moment, says John Mather, Nobel 
laureate and senior project scientist for JWST, 


ty _ the telescope is designed to last for at least five 


s, but longer may be possible. It will carry 
s worth of fuel, and the presence of 


_ the cryocooler means that, unlike earlier 


ms : 
Collection area: 33 square metres 
First light: 2013 
Cost: US$4.5 billion all in : . 
Unique selling point: The best infrared possible 


roy 
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infrared missions, its lifetime is not limited | 
by a fixed supply of coolant. “If we are lucky | 
and clever we hope to conserve fuel and per- 
haps run much longer,’ says Mather. “But we 
can't promise that.’ What Mather thinks he 
can promise is discovery. “We do not know 
which came first, black holes or galaxies, and 
we do not know how it happens that there is 
a massive black hole at the centre of almost 
every massive galaxy. If there are any surprises 
about the early Universe, I am guessing that 
they will be in these areas.” 

JWST is not just about deep space and dis- 
tant epochs, though; it will also scrutinize the 
shrouded origins of objects closertohome J 
— such as nascent solar systems, coalescing | 
stars and star clusters amassing within dusty 
nebulae, says Matt Mountain, director of the 
Space Telescope Science Institute. 

But where the telescope will really stand 
out will be its ability to probe the very early | 
Universe. “JWST is so sensitive, says Moun- 
tain, “that we can take actual spectra of the - 
very earliest objects you can just barely 
detect with Hubble? 
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AD P, MOVING IMAGE 


ometimes telescopes see double 

not because of any aberratio , but 
because that is the wa the Uni- 

verse works. The bending of light by 
intervening masses — called gravitational 
lensing — means that some galaxies are seen 
by Earthly observers in more than one place. 
By adding together survey image after sur- 
vey image, and so measuring things that no 
individual image would show, the design- 
ers of the Large Synoptic Survey Telescope 
(LSST) hope to find a significant fraction of 
the 10,000 or so such images in every square 
degree of sky. They also hope to open\up a 
_ * neglected dimension in astronomy: time. As 
well as adding together images of the same 


part of space taken again and again to reveal 


new v depth, they will compare those images 


ary mirror, 


i 
Se 


which will be ground so as to provide a field 
of view of 10 square degrees. That’s 49 times 
the size of the full Moon, and more than 
300 times the field of view of the Gemini 
telescopes, which have mirrors of similar 
size optimized for staring in a single spot. 
Over ten years, says Zeljko Ivezié, of the 
University of Washington in Seattle, the 
LSST system will look at everything in its 


- field of view about 1,000 times. A massive 


amount of comune power will be used to 
correlate, compare and catalogue the torrent 


. of data — and to make them all available 


supernovae and new asteroids, for days o \ 


rational, the network will be able to keep a stani 2ye on new-found objects, such as 
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on the Internet. Anyone with a computer 
ts, and amateur and professional 
astronomers — will be able to participate in 
the process of scientific discovery. 

Studies of objects that have been gravita- 
tionally lensed should reveal huge amounts * 
about the structure of the Universe in gen- 
eral, and the distribution of dark matter and 
the effects of dark energy in particular. At 
the same time, though, LSST will mount a 
virtual space patrol, looking for potentially 


‘hazardous near-Earth asteroids. Astrono- 


mers already know where most of the big, 


re will be a lot of overlap. Once fully 
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illing-off-species-wholesale asteroids are. 
LSST will be one of the tools that catalogues 
the vast majority of lesser asteroids still 
capable of smashing a city. But with a sen- 
sitivity to faint, transient events 1,000 times 
greater than ever previously achieved, the 
telescope will not restrict itself to the ‘ver- 
min of the skies’ in Earth’s backyard. It will 
observe vast distant cataclysms, such as col- 8 
lisions between neutron stars, and is all but 
sure of discovering whole new categories of 
transient events. 

The project is overseen by the LSST 
Corporation, comprising more than 100 
scientists and two dozen laboratories, uni- 
versities and institutes based mainly in the 
United States. Although the project’s design 
is still being wo rked out, then main mirror 


\ ith the 


{ a survey, | 
object in the Universe, known, unknown 
and newly discovered. “For the first time in 
history,” says Ivezi¢, “we will catalogue and 
study more celestial objects than there are 
people on Earth” | 
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Pee es ; 
ee = a Ue . Telescope (OWL), it was to have had a mirror 
Sean «! & eS s° 100 metres in diameter. The current 42 fintcon 


SUPERSIZING THE HEAVENS 


The European Extra Large Telescope 


. . Bes . 3 os Ce oe. “gore pragmatic. But if adaptive opticsithat con- 
eS ome ee stantly correct the mirror’s shape to offsetatmos- 
oo,» pheric turbulerice work for a system of this size, 


pace-based observatories have 


* much to recommend them; they, 


see in wavelengths forever hidden 

from observers on Earth's surface, 
and with the crystal clarity of an airless sky. 
But they are hard put to compete in one of 
the most important aspects of telescope 
design: size. Whether you want a telescope 
for a backyard or a mountain top, the larger 
the light-gathering surface, the more pho- 
tons you will catch. With a primary mirror 
measuring 42 metres across, the European 
Extremely Large Telescope (E-ELT) is 


designed to pick out features only hinted 
at in the best images made by orbiting 
telescopes, detecting objects as faint as the 
planets around other stars. Its backers think 
that such a telescope could put satellites out 
of the optical-and-near-infrared business 
until similar sized behemoths can be assem- 
bled in orbit — or on the Moon. 

Today E-ELT, a project of the Euro- 
pean Southern Observatory (ESO), is 
halfway through a 3.5-year, €57.2-million 
($72.5-million) detailed design phase. Con- 
struction could start as soon as mid-2010. 


It is not the only such giant planned: the 
Thirty Meter Telescope is being built bya 
public-private consortium including uni- . 
versities in North America, and the Giant 
Magellan Telescope is a collaboration 
between several US and Australian univer- * © 
sities and research institutes (see Nature 
452, 142-145; 2008). But E-ELT is the most * 
audacious in scope and, with a budgetof 
€950 million, the most expensive. . vee 
That said, the current E-ELT design is * ~ 
a scaled-down version. When originally: 
discussed as the OverWhelmingly Larges“, 
, ; : =>: * 


fs Rae Re as its designers hope, E-ELT should still have a 


~ spatial resolution 18 times better than that of 
the Hubble Space Telescope. Today’s largest 
mirrors are those of the twin 10-metre Keck 
telescopes on Mauna Kea; E-ELT would have 
a light-gathering area almost nine times 
larger than both of them put together. 
When it comes to thinking big, E-ELT’s 
designers have the advantage, not shared by 
astronomers in the United States, of steady 
annual funding. ESO’s management hopes to 
build the giant without asking the organization's 
13 member states for more money, or having to 
hunt around for new partners. That could still 
change though: the telescope’s optical design is not 
finalized, nor is its ultimate location. The Canary 


Islands, Morocco, Argentina and two sites in Chile 


. are all under discussion. 
: All this is just background noise to Roberto 
f\ : Gilmozzi, E-ELT’s principal investigator, who 
ie already has his eye on the telescope’s potential 


discoveries. “One of the key goals of the telescope 
will be the quest for Earth-like planets around 
other stars,” he says, “both by indirect methods 
— such as measuring radial velocity variations in 
the parent star — and by direct imaging and spec- 
troscopy of their atmospheres.’ It will also aim to 
study proto-planetary systems around young stars 
in unprecedented detail. 

“E-ELT will provide a ‘quantum jump’ in 
sensitivity over the present generation of tele- 
scopes — comparable to that of Galileo's telescope 
over the naked eye,” says Gilmozzi, and that will 
affect astronomy at all scales. Galaxies that are now 
blurs will turn into fields of distinct stars. And on 
the largest of scales the telescopes will carry out 
a “physics experiment” that would not be pos- 
sible with any other telescope, Gilmozzi says. By 
comparing ultra-precise velocity measurements 
r of very distant objects taken several years apart 

E-ELT will make the first direct observations of — 
4 } how fast the Universe is expanding. a 


Other ways to other planets 


The international COROT mission, smaller planets, particularly in or near the from space with such satellites and study 
launched in 2006, has spent two years ‘habitable zones’ around stars where liquid from Earth with a new generation of very 
trying to detect dips in the brightness of water is possible on the surface. E-ELT large telescopes such as E-ELT means 


fe =isi=|; | 


Collection area: 1,385 square metres 


First light: 2010s 
Cost: $1.4 billion to first light 


Us hracdhetstinghemamne stars caused by intervening planets. Later might be used to confirm and follow up that astronomers will be able to start 


this year it will be joined by NASA's Kepler the findings of such missions — perhaps putting together profiles of other planetary 
mission, and a follow-on European mission, making spectroscopic measurements of systems, some of which may be similar 
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Obama must match 
science rhetoric 
with action 


SIR — | welcome the news of 
Barack Obama's smart choices for 
cabinet and sub-cabinet positions. 
This is a promising sign that the 
scientific process will once again 
be broadly valued here. 

A group of graduate students 
recently asked my perspective on 
the impact of the current funding 
crunch. As a young investigator 
who began a tenure-track position 
in 2004, one year after the US 
National Institutes of Health 
(NIH) funding trajectory inflected 
from flat to negative, | understand 
the challenges to new scientists. 

Since 2004, | have been the 
main author of three published 
articles. During this time, | have 
been awarded several small 
grants and have narrowly failed to 
attain an RO1 application — the 
benchmark for promotion and 
tenure at most US universities. 
Meanwhile, | have watched junior 
faculty with similar records moved 
out of the tenure stream, seen 
scientific enthusiasm eroded and 
heard faculty members question 
the rewards of a career committed 
to research. What makes the 
crucial difference for me is 
tremendous support from my 
department and colleagues. 

The former NIH director, Elias 
Zerhouni, strove to reverse the 
trend that has raised the average 
age of first funding from 37 in 
1980 to 42 in 2007. As a result 
of his efforts, at least 1,650 
investigators will receive their 
first RO1 in 2009, up from 1,354 
in 2006 (see Nature 456, 153; 
2008). | hope to be among them. 
My age? 39. My optimism? High, 
reflecting a supportive university 
environment. 

Continued, active investment in 
new faculty not only ensures the 
success of young scientists but 
also directly influences the career 
decisions of PhD candidates. 
| hope our new president and 
administration truly recognize 
that our future is in science 
and technology, and that a 
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new generation of innovators 
needs their full support. 

Lisa Borghesi Department of 
Immunology, University of 
Pittsburgh School of Medicine, 
Pittsburgh, Pennsylvania 15261, USA 
e-mail: borghesi@pitt.edu 


Problems with 
anti-plagiarism 
database 


SIR — Sophisticated tools have 
been developed to detect duplicate 
publication and plagiarism, as 
noted in M. Errani and H. Garner's 
Commentary ‘A tale of two 
citations’ (Nature 451, 397-399: 
2008) and in your News story 
‘Entire-paper plagiarism caught by 
software’ (Nature 455, 715; 
2008). To my surprise, one of 
these tools, Déja vu (http://spore. 
swmed.edu/dejavu), classifies 
four of our publications as 
unverified duplicates. These report 
the analysis of Bruton’'s tyrosine 
kinase mutations associated with 
the rare disease X-linked 
agammaglobulinaemia (XLA) and 
of the database BT Kbase. 

Each of these is a genuinely 
different and independent report; 
they cover the development of the 
database and different analyses of 
the growing data set. The reason 
why they are branded as suspect 
cases is probably that the journal 
Nucleic Acids Research, in which 
three of them were published, 
has a special format for articles in 
their annual database issue. 

Between 1995 and 2006, 
we published eight articles on 
BTKbase. The number of XLA 
cases recorded in the database 
has grown from 118 to 1,111 during 
this period. Several colleagues 
who maintain databases are also 
listed in Déja vu. It is worrying that 
such legitimate articles written by 
research infrastructure developers 
and providers are labelled as 
unethical, just because of some 
overlap with previous papers 
as a result of ajournal's strict 
formatting requirement. 

Detection of fraud, including 


duplications, is obviously crucial 
to the integrity of science. But 

it is unethical to list thousands 

of scientists in a public Internet 
service as suspects, without 
verifying the claims that are being 
made. Although the developers 
indicate that the data are 
provisional, there is still a risk 
that the listing will affect decisions 
on careers, promotions or 
research funding if individual 
cases are not investigated. 

No professional scientist wants 
even the slightest suspicion of 
fraud to tarnish their scholarly 
reputation, so listed cases need to 
be closely investigated. To detect 
real duplicates, the full-length 
articles must be analysed, not just 
the abstracts — which occurred in 
the case of our publications. 
Mauno Vihinen Institute of Medical 
Technology, FI-33014 University of 
Tampere, Finland 
e-mail: mauno.vihinen@uta.fi 


Honeybee and the 
Phoenix analysing 
instrument 


SIR — In the News Feature 
‘Phoenix: a race against time’ 
(Nature 456, 690-695; 2008) 
you report ona problem that 
stopped the doors to the ovens on 
the Phoenix spacecraft's Thermal 
and Evolved Gas Analyzer (TEGA) 
instrument from opening fully. You 
note that the University of Arizona 
team responsible for TEGA 
noticed the door interference 
problem during engineering 

tests and sent revised designs to 
Honeybee Robotics of New York, 
but that Honeybee Robotics sent 
back new parts using the “original 
flawed designs”. 

We at the University of Arizona 
are concerned that Honeybee's 
responsibility for the TEGA door 
opening has been overstated. As 
the dust-cover manufacturer, the 
company played a very minor role 
in the overall TEGA construction. 
Most importantly, it was not 
responsible for establishing 
that the TEGA doors functioned 
properly after the dust-cover 


© 2009 Macmillan Publishers Limited. All rights reserved 


assembly was installed. 
Unfortunately, because the door- 
opening mechanism is a single- 
use actuator, which is buried deep 
within the instrument, we on the 
instrument team could not test it 
on the flight unit without a time- 
consuming process of dismantling 
and rebuilding the system. 

In light of the publicity given to 
the door problem, we would like 
to set a few of the facts straight. 
When the door-opening problem 
was first observed during testing 
of an engineering prototype, we 
documented the needed change, 
along with some other changes, 
in an engineering drawing that 
was a modification to an earlier 
Honeybee drawing. Our drawing 
had more than a dozen changes 
onit, all but one of which were 
well documented with new 
dimensions called out. However, 
the change that related to the 
issue with the doors, although 
drawn properly, was not explicitly 
called out as a change in this way. 

We should have asked to review 
the final drawing before metal 
was cut, but we did not. As we 
could not test the operation of the 
doors in the flight unit, we should 
have checked the parts when they 
arrived, but we did not. | recognize 
that my group at the University 
of Arizona operates a bit more 
informally than is current practice 
in the aerospace industry; this 
has served us well in the past in 
keeping costs down, but entails 
risk. We should have caught the 
problem and we didn't. In the end, 
the buck stops with us. 

There is nothing associated 
with this event that changes our 
full confidence in Honeybee's 
competence. We have had an 
excellent working relationship 
with the company on this and 
several earlier projects, and 
Honeybee would clearly be 
our vendor of choice for similar 
projects in the future. 

William V. Boynton TEGA instrument 
lead, Department of Planetary 
Sciences, The University of Arizona, 
Tucson, Arizona 85721, USA 

e-mail: wboynton@lpl.arizona.edu 
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COMMENTARY 


Time to turn off the lights 


Cities needlessly shine billions of dollars directly into the sky each year and, as a result, a fifth of the world’s 
population cannot see the Milky Way. Malcolm Smith explains why a dark sky has much to offer everyone. 


ver the past six decades, profes- 
() sional and amateur astronomers 
have been pioneering efforts to curb 
light pollution to protect the viability of 
their observatories. During the 2009 Inter- 
national Year of Astronomy, particularly 
through the Dark Skies Awareness project’, 
astronomers can find allies ina common 
cause to convince authorities and the pub- 
lic that a dark sky is a valuable resource for 
everyone. Reducing the number of lights on 
at night could help conserve energy, protect 
wildlife and benefit human health. 

The most persuasive arguments for 
lighting control are economic ones. Esti- 
mates by the International Dark-Sky 
Association, based on work from satel- 
lite images’, show that cities needlessly 
shine billions of dollars directly into the 
sky each year. As education on these issues 
improves, some cities are now realizing the 
benefits of controlling such energy waste 
through better-quality lighting, thereby 
reducing dangerous glare and confusing 
lighting clutter. Recent cost-saving meas- 
ures in New York include lighting codes 


Village lights reflect off Saharan dust beyond the Teide 
Observatory on Tenerife, obscuring stars near the horizon. 


suppressant of cell division in cancerous 
tissues””. It is imprudent to draw any grand 
conclusions yet, but we should continue to 
work with medical professionals to see if 
there are opportunities for cooperation. 

Our relationship with artificial light 
is complicated and changing. Humans 
innately fear the darkness, and modern 
society relies on light as a security measure 
even though there is no evidence that con- 
trolling light wastage increases crime levels. 
Moreover, blinding automobile headlights 
require higher levels of street lighting to 
recover visibility of what’s behind them. 
In many places in the world, one can drive 
around moderately lit, smaller towns with 
side lights. In the countryside, moderate 
headlights can be supported with catseye 
road reflectors. 

As light fixtures with energy-efficient, 
light-emitting diodes start to become 
more widespread, the International Dark- 
Sky Association has been harnessing the 
support of professional lighting engineers 
and other non-astronomy interests to 
encourage responsible, innovative light- 


that require dimmers and motion-sensor- 
enabled light switches in many buildings. 
Although humans are generally comfortable 
in artificially lit environments, man-made light 
sources can sow confusion for other animals, 
often with deadly results. For example, initia- 
tives run by the National Audubon Society in 
New York, or Toronto’s Fatal Light Awareness 
Program, focus on the effects that lights can 
have on bird migrations. In more and more cit- 
ies in Canada and the United States, switching 
off at least some of the light in nearly 
empty skyscrapers reduces the unnec- 
essary annual slaughter of millions of 
migrating birds in North America’. 
Some authorities now recognize that 
lights that are switched off during the 
migration season are probably not necessary. 
Yet, some cities still switch them back on when 
migration is over. There are many other exam- 
ples and opportunities for cooperation*”. 
Humans are affected too, especially as 
urban areas expand worldwide®. More than 
two-thirds of the US population and one-fifth 
of the world’s population have lost naked-eye 
visibility of the Milky Way’. This has a sub- 
tle cultural impact. Without a direct view of 
the stars, mankind is cut off from most of the 


Universe, deprived of any direct sense of its 
huge scale and our tiny place within it. Sat- 
ellite and nightscape photos have illustrated 
beautifully just how much humans stand to 
lose. The work of The World at Night pho- 
tographers has been incorporated into efforts 
for the International Year of Astronomy’. The 
International Astronomical Union has recently 
signed a memorandum of understanding with 
the United Nations Educational, Scientific and 
Cultural Organization (UNESCO) to 
pursue the goal of identifying and pro- 
tecting astronomical sites of historic 
significance. And the UNESCO-sup- 
ported Starlight Initiative” has a major 
international programme investigat- 
ing more deeply the issues discussed here. The 
potential for profitable ecotourism, associated 
with protected, natural, starlit skies is now 
being understood in many countries. The US 
National Park Service has launched a highly 
successful night-time-visits service in a joint 
effort between astronomers and park staff. 
Even our physiology is deeply entwined with 
dark-light cycles. Recent research shows that 
light at night triggers signals that cause a reduc- 
tion in the normal production of melatonin, a 
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ing and guidance design. Although it will 
be challenging, we can work together to reduce 
the form of pollution and energy waste that is, 
arguably, the most easily mitigated. a 
Malcolm Smith is an astronomer at the Cerro 
Tololo Inter-American Observatory, Casilla 603, 
La Serena, Chile. 


e-mail: msmith@ctio.noao.edu 
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Mankind’s place in the Universe 


Technological developments in astronomy have long helped to answer some of the greatest questions 
tackled by humanity, recounts Owen Gingerich. 


Just over 400 years ago, news 
swept across Europe of an 
astonishing device — two lenses 
within a tube — that almost 
magically brought distant views 
closer. Its exact origins are lost 
in a mist of controversy, but it is known that 
one Hans Lipperhey in Middelburg, the Neth- 
erlands, applied for a patent in October of 
1608. His application was turned down on the 
grounds that the unnamed novelty was already 
common knowledge. 

Seventeen months later a small 
booklet by an Italian mathemat- 
ics professor announced wonderful 
discoveries made with this newly 
devised spyglass. The title page may 
have implied that the ‘perspicillum’ 
was Galileo Galilei’s own invention, 
but he should be forgiven any exag- 
geration, for what Galileo had done 
was to turn a popular carnival toy into 
a scientific instrument. In December 
1609 he used his spyglass to discern 
craters, mountains and plains on 
the Moon. The 400th anniversary of 
these astonishing observations sets 
the stage for what the United Nations 
has designated the International Year 
of Astronomy. 

By January 1610, Galileo had fur- 
ther found four bright moons of the 
planet Jupiter (it would be his con- 
temporary, Johannes Kepler, who 
named them ‘satellites’). Before that 
year was out Galileo had also found 
the phases of Venus, proving that it 


Earth-like, rather than a pure ball of crystal- 
line aether: the perceived distinction between 
Earth, home of corruption and decay, and the 
eternal, aethereal heavens was crumbling. Jupi- 
ter had no trouble keeping its moons in tow, 
and everyone agreed that Jupiter was moving. 
Why could a moving Earth not keep its Moon 
on an invisible tether? Galileo's skillful polem- 
ics on this matter became full-blown in his 
1632 Dialogue on the Two Chief World Systems 
—a work that failed to conclusively prove the 


Galileo's observations of the Moon had far-reaching consequences. 


in his A Perfit Description (1576), placed a “hab- 
itacle [home] for the elect” among the stars. 

It is not surprising that clerical authorities for 
the most part resisted the new cosmology. More 
than this, the Roman Catholic authorities were 
furious that Galileo, a mere amateur theologian, 
had the audacity to tell them how to interpret 
scripture. It was clearly a turf battle. Galileo 
was brought to trial in Rome, allowed to plead 
innocent to charges of vehement suspicion of 
heresy, but convicted of disobeying orders (in 
teaching the Copernican cosmology), 
and sentenced to house arrest for the 
remainder of his life. 

“New philosophy calls all in doubt,’ 
lamented the poet John Donne. “Tis 
all in pieces, all coherence gone.” 

Galileo’s persuasiveness was helped 
along by Kepler, whose landmark vol- 
ume of theoretical astronomy was also 
published in 1609. His aptly named 
Astronomia nova or ‘New Astronomy’ 
relied heavily on looking at the physi- 
cal causes of planetary motion — a 
critical departure from the past, when 
astronomers used strictly geometrical 
modelling to explicate the heavens. 
Even Kepler’s teacher and mentor, 
Michael Maestlin, urged him to for- 
get about physics and stick to astron- 
omy (that is, geometry). But Kepler 
believed in a physically real cosmos, 
and even ahead of Galileo advocated 
the Sun-centred system. 

Heliocentrism as it had come down 
from Copernicus, however, was 
flawed. Copernicus used entirely dif- 


revolved about the Sun. 

The implications of such findings were 
immense. Although Nicolaus Copernicus'’s 
De revolutionibus orbium coelestium (‘On the 
Revolutions of the Heavenly Spheres’; 1543) 
— the work that proposed a Sun-centred or 
‘heliocentric system — was already nearly seven 
decades old, it was taken only as a manual for 
computing positions of the planets; by no means 
was it thought to bea physically real description 
of the cosmos. How ridiculous to think of liv- 
ing on a globe that spun around once every 24 
hours! And how could Earth keep the Moon in 
tow if it whirled around the Sun every year? 

It was Galileo’s work that made it intellectu- 
ally respectable to believe in heliocentrism asa 
physical reality. Galileo saw that the Moon was 
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motion of Earth, yet was persuasive enough to 
land him in deep trouble with the Inquisition. 


Heavenly vision 

Aristotle had, some two millennia earlier, 
described a mundane world of earth, air, fire 
and water that was separate from the eternal 
purity of the aethereal spheres in the heavens. 
The ecclesiastical vision of a heavy, central 
Earth upon which the human drama played fit 
nicely into this Aristotelian cosmos — volca- 
noes belching out smoke and flames hinted at 
the hellfire below, whereas the realm of God and 
the elect lay beyond the starry firmament. Even 
Thomas Digges, the English astronomer who 
supported the “noble Copernican philosophy” 
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ferent geometrical models for the lati- 
tudes of planets than for the longitudes, whereas 
for a physically real system, the same model 
should work for both coordinates. According 
to Copernicus, the planets moved in circles (the 
main ones eccentrically placed with respect to 
the Sun), and Earth did not move faster when 
closest to the Sun. It was Kepler’s requirement 
for plausible physical explanations that drove 
him ultimately to postulate an ellipse as the 
basic form of planetary orbits, ironing out these 
difficulties. 

Alongside Galileo’s telescopic observations, 
Kepler’s improvements in planetary theory 
eventually brought about general acceptance 
of the Copernican cosmology. In the intro- 
duction to Astronomia nova, Kepler wrote: 
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“Perhaps there is someone whose faith 
is too weak to believe Copernicus 
without offending his piety. Let him 
stay at home and mind his own busi- 
ness. Let him assure himself that he is 
serving God no less than the astrono- 
mer to whom God has granted the 
privilege of seeing more clearly with 
the eyes of the mind” 

Before and since, astronomi- 
cal devices have been at the heart 
of mankind’s literal search for our 
place in the Universe. It is not an 
inexpensive endeavour. Tycho 
Brahe boasted that his observations 
(which did not involve a telescope 
but rather carefully graduated 
naked-eye sightings using quad- 
rants and other devices) cost the 
King of Denmark more than a 
ton of gold — a huge fortune in 
the sixteenth century. Today, tel- 
escopes can cost billions of dol- 
lars. It could be argued that such 
devices are ultimately as useful 
(or useless) to human progress 
as poetry. But they are funded 
because, like the greatest poetry or the build- 
ing of a Gothic cathedral, cosmology is a voyage 
of the human spirit, and as such, a point of per- 

sonal or national pride. It is worth a great price 
to understand our place in the intricate web of 
time and space. 


The farthest reaches 
After Galileo's work and Kepler's refinements 
— replacing a convex objective lens and con- 
cave eyepiece with two convex lenses, allowing 
for a wider field of view and accommodating 
crosshairs — the next major leap forward for 
observational astronomy would not come 
for nearly two centuries. Towards the end of 
the eighteenth century, William Herschel, 
an inspired musician-turned- 
amateur-astronomer, began 


“Will we soon 


Fresh views: 

Thomas Digges's 
heliocentric Universe 
(above) and Galileo's 
perspicullum. 


mirrors, and the abil- 
ity of photography to 
capture images of stars 
and nebulae, astrono- 
mers finally marched 
through that portal. In 1859, in a chemical lab- 
oratory in Heidelberg, chemist Robert Bunsen 
and physicist Gustav Kirchhoff discovered the 
key to unlocking the chemical secrets of the 
Universe via the distinctive spectral lines asso- 
ciated with individual elements. 
Meanwhile, rich philanthropists 


his seemingly quixotic quest to answer the next poured cash into instruments of 
determine the three-dimensional seas discovery, and from the new giant 
structure of the Universe. In the psyche shifting telescopes emerged a sequence of 
process, he discovered anew question: are We —_consciousness-raising discoveries. 
planet — Uranus — anevent that alone?” Among them was the immense 


changed his status from amateur 
to professional. The most inventive astrono- 
mer of his age, Herschel set on a path to ever- 
greater light-gathering power by building huge 
reflecting telescopes. He ended up counting 
thousands of stars and discovering hundreds 
of clusters and nebulae — the gateway to the 
vast outer Universe beyond the traditional 
sphere of naked-eye stars. 

A century later, thanks to engineering 
advances, the discovery of how to silver glass 


size of the Milky Way, far beyond 
anything dreamed of by Herschel. Then came 
the realization that the Milky Way was but one 
of myriad galaxies stretching beyond by mil- 
lions (and even billions) of light years, rushing 
away from each other as if from a cataclysmic 
explosion billions of years ago. 

This vast increase in the size and age of the 
perceived cosmos set the stage for an angst 
as deep as that caused by the displacement 
of the ancient human-centred cosmology: 
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what significance do rational, 
observing mortals have in the 
wilderness of a near-infinite 
space-time continuum? This 
existential question has been 
simmering for decades and 
undoubtedly drives our willing- 
ness to invest taxpayers money 
in further cosmic explorations. 
Today, the electronic revolu- 
tion has provided charge-coupled 
devices to replace photographic 
plates that have nearly 50 times 
their efficiency; has made feasible 
the collection and handling of data 
in the trillions of bits; and has ena- 
bled high-speed adaptive optics to 
compensate for atmospheric varia- 
tions, allowing astronomers to take 
full advantage of large-aperture 
Earth-based telescopes. 
The stellar and galactic 
denizens of the Universe 
can now be catalogued 
as never before. That 
has included, in recent 
years, the discovery and 
cataloguing of exoplan- 
ets — those beyond our 
Solar System. 

NASA’ Kepler mis- 
sion, scheduled for 
launch in March 2009, 
is the first craft designed 
explicitly to look for 
Earth-like exoplanets. 
Dont expect radio com- 
munications from intel- 
ligent life to come from one of those planets. 
But other exciting signals could surely be found. 
Today, spectroscopy can be used to spy on the 
atmospheres of far-away planets, and look for 
chemical traces that may hint at signs of life — 
even an extremely basic form of life could pro- 
duce an oxygen signal in a planet’s atmosphere. 
Such studies may help in answering the next 
psyche-shifting question: are we alone? 

The International Year of Astronomy might 
well launch the next intellectual revolution in 
our understanding of our place in the Universe. 
Could this have as much of an impact on soci- 
ety as Galileo and Kepler’s entrenchment of the 
heliocentric view? Only time will tell. a 
Owen Gingerich is professor emeritus of 
astronomy and history of science at the Harvard- 
Smithsonian Center for Astrophysics in Cambridge, 
Massachusetts, and author of God's Universe. 
e-mail: ginger@cfa.harvard.edu 


See http://tinyurl.com/57589t for further reading. 
See also www.nature.com/astro09. 
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Visions of ourselves 


The view of our planet from space is beautiful and humbling, yet this shift in human perspective has not 
altered how we care for our environment, argues Charles Cockell. 


Earthrise: How Man First Saw the Earth 
by Robert Poole 
Yale University Press: 2008. 236 pp. $26 


It is human nature to try to 
understand our place in the 
cosmos. Changes in our percep- 
tion of human significance have 
accompanied many revolutions 
in scientific thought, such as 
the theory of evolution and establishing the 
order of the planets in the Solar System. So 
it is perhaps unsurprising that as soon as we 
developed the ability to launch projectiles into 
space, one of the first things we did was to use 
them to look back at ourselves. 

The history and significance of the view 
of Earth from space is the subject of Robert 
Poole'’s Earthrise. Although its publication is 
titled and timed to mark the 40th anniversary 
of the first snapshots of Earth rising over the 
Moon’s horizon on 24 December 1968, the 
book also traces the long journey that led 
to Apollo 17’s iconic image, “Blue Marble’, 
captured on 7 December 1972. 

Of the many attempts before the space age 
to understand how Earth looks from afar, the 
Roman historian Seneca provides the most 
remarkable quote: “Is this that pinpoint which 
is divided by fire and sword among so many 
nations? How ridiculous are the boundaries 
set by mortals.” The first human view of the 
curvature of Earth is attributed to the high- 
altitude balloon flight of Explorer II in 1935. 
Floating 22 kilometres above the ground, two 
intrepid aeronauts inside the balloon’s pressu- 
rized gondola snapped a photograph that later 
appeared in National Geographic, with a line 
drawn across the horizon to illustrate that the 
curvature had been seen. 

A robot took the first picture of the whole 
Earth in one frame. Black and white photo- 
graphs of the planet, developed on board the 
spacecraft and scanned by a light that picked 
up different shades of grey, were beamed back 
from Lunar Orbiter 1 on 23 August 1966. 

NASA photographer Richard Underwood 
emerges in the book as an early crusader for 
photographs of Earth from space taken by 
humans. He battled against bureaucrats who 
either thought it was a waste of astronaut time 
or could not see the scientific value. But during 
Apollo 8s mission, on Christmas Eve 1968, Ear- 
thrise over the Moon was captured for the first 
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Forty years ago, the crew of Apollo 8 
captured this iconic image of Earth 
rising over the Moon. 


time. Even though the astronauts were trained 
for this eventuality, their transcripts reveal that 
they found the beauty of the sight profound 
and unexpected. The view of Earth from space 
has deeply affected many astronauts, particu- 
larly the lunar-module pilots who, while their 
fellow countrymen were wandering about on 
the Moon, had the time not merely to stare at 
Earth, but also to think about it. 

In my view, however, the sociological signifi- 
cance of the Earthrise photograph is exaggerated 
— a possibility that Poole might have explored 
more. He devotes just a page to political theo- 
rist Hannah Arendt'’s thoughts and author Kurt 
Vonnegut’s contrarian observation about the 
Earth images: “It looks so clean. You can't see 
all the hungry, angry earthlings down there — 
and the smoke and the sewage and trash and 
sophisticated weaponry.’ And yet, the idea that 
the environmental movement was caused by 
this view of Earth is controversial. One might 
equally say that our growing understanding of 
the connectedness of the biosphere was caused 
by the awakening in the 1960s to the global con- 
sequences of the use of atomic weapons and the 
effects of DDT insecticide. Yet the Apollo images 
were co-opted by the environmental movement 
and vocalized as another space-related analogy, 
‘Spaceship Earth; a concept that highlighted the 
planet's limited resources. 
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The promise of the future effect of this 
holistic view of Earth and its inhabitants may 
also be utopian. The revelations of Charles 
Darwin and Nicolaus Copernicus were not 
accompanied by a reduction in the number of 
wars as a result of a new humility. One might 
even make the case that understanding more 
fully the beautiful, but nevertheless insig- 
nificant scale of our planet might eventually 
dull us to acts of large-scale destruction as 
our field of play expands beyond the borders 
and histories of nations to territorial views of 
whole worlds. 

Whatever one’s opinion, seeing the view 
of our planet from space has been a psycho- 
logical transition point. Like the first time one 
hears one’s own recorded voice, or sees oneself 
on a video, we have been confronted with an 
objective view of ourselves that does not quite 
equate with our previous self-perception. 
Earthrise is a fascinating contribution to a 
growing discussion about how space explora- 
tion and settlement will change civilization.m 
Charles Cockell is professor of geomicrobiology 
at the Open University, Milton Keynes MK7 6AA, 
UK, and author of Space on Earth: Saving Our 
World by Seeking Others. 
e-mail: c.s.cockell@open.ac.uk 


See Editorial, page 7, and www.nature.com/astro09. 
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QSA: One giant leap for art 


Astronaut Alan Bean stepped down onto the lunar surface during the 1969 Apollo 12 mission, but left NASA in 1981 to 
devote himself to painting. With exhibitions of his work taking place this year to celebrate the 40th anniversary of the 
first Moon landing, he tells Nature how he attempts to convey his lunar experience. 


How did you become a painter? 
Accidentally, I ended up becoming the first 
artist in history to go anywhere else beyond 
Earth and paint it. When I got ready to leave 
NASA, half of the astronauts wondered if this 
was a worthwhile thing to do with my life. 
And the others weren't sure. Now they all like 
it. They can see that I’m preserving one of the 
great adventures of humankind in a way that 
is meaningful for them. 


Why are your Moon paintings so 
colourful? 

The Moon isn’t colourful 

— it’s grey. The sky is black 
and we run around in white 
suits. My first paintings were 
mostly grey, because that’s 
what the Moon is. Now I 
realize the artist’s job is not to 
reproduce reality. 


Why do you use Moon dust 
and tools from your mission? 
The Moon was pretty rugged, 
so I wanted to put some texture 
in. The first texture I added was with normal 
art tools. Then I said “Why am I doing this? 
I've got Moon tools” — I’ve got the hammer I 
had on the Moon; I’ve got Moon boots. 

A friend in charge of a museum requested 
the Apollo 12 command module, and they 
shipped it to him in this huge box. He 
looked in the box and a lot of the charred 
heat shield had come off, so he vacuumed it 
up and sent it to me. I put that in there. Then 


Alan Bean's Rock ‘n’ Roll on the Ocean of Storms shows the trials of collecting Moon rock. 
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one day I was looking at the 
patches from my suit and 
——. saying “Boy, those things 

are dirty”. And it dawned 
on me that they’re dirty with Moon dust 
from the Ocean of Storms. I started cutting 
them up and put them in too. 


What are you working on now? 

My current painting tells the story of when 

I fell down on the Moon and Pete Conrad 
(Apollo 12’s commander) came over and lifted 
me up with one finger. I'm lying on my butt 
leaning back and I’ve got my left arm out. 


Pete's got his right forefinger around my left 
forefinger and he's lifting me up. Pm thinking 
of titling it He Ain't Heavy, He’s My Brother. 


What is your view of the Moon landings 
40 years on? 

We went to the Moon, truthfully, in peace 

for all mankind. We were racing the Russians 
but we didn’t try to do anything more than 
prove that we could do it. It raised the human 
consciousness on Earth because people said 
“Look what we can do”. We've got to keep 
doing these things. a 
Interview by Daniel Cressey, a reporter for Nature. 


Voyaging to discovery, alone 


The Age of Wonder: 

How the Romantic Generation Discovered 
the Beauty and Terror of Science 

by Richard Holmes 
HarperPress/Pantheon: 2008/2009. 
380 pp/576 pp. £25/$40 


Everyone knows how to be a great scientist. 
First, you have to be really smart. Having awed 
your school science teacher is good. Humbling 
all the others at university is even better. Then 
you need to find a place where you can let your 
solitary genius come out. You might escape to 


a coffee shop to think; or if your parents are 
really rich, to a trendy loft apartment or, better 
still, a remote and windswept cottage. There 
you will engage in the next and most crucial 
of stages: the creative torture. 

There is an art to this. It is important your 
torment doesn't end too quickly — otherwise 
youd show you had been working on a prob- 
lem that was too simple. Yet if the torment 
never ends, you'll have nothing to tell anyone 
about and will remain unknown. After some 
time, ideally a few months, you get to have a 
‘eureka moment. At that point all you have 
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to do is write up your discovery, accept the 
plaudits, be it from department colleagues or 
the Swedish Academy, and then if you can bear 
it, repeat from the start. 

We smile at this recap but the basic vision — 
science as an endeavour of individual creative 
genius that explodes in an instant of discovery 
— is one we take for granted. Yet, as Richard 
Holmes describes in his new book The Age of 
Wonder, it was not always so. Most founders 
of modern science in the 1600s, such as Isaac 
Newton, rarely saw their work this way. For 
them the process was clinical, building on a 
slow accumulation of insight. 

The big shift took place in the decades 
around 1800, a period called the Romantic era. 
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your school science teacher is good. Humbling 
all the others at university is even better. Then 
you need to find a place where you can let your 
solitary genius come out. You might escape to 


a coffee shop to think; or if your parents are 
really rich, to a trendy loft apartment or, better 
still, a remote and windswept cottage. There 
you will engage in the next and most crucial 
of stages: the creative torture. 

There is an art to this. It is important your 
torment doesn't end too quickly — otherwise 
youd show you had been working on a prob- 
lem that was too simple. Yet if the torment 
never ends, you'll have nothing to tell anyone 
about and will remain unknown. After some 
time, ideally a few months, you get to have a 
‘eureka moment. At that point all you have 
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to do is write up your discovery, accept the 
plaudits, be it from department colleagues or 
the Swedish Academy, and then if you can bear 
it, repeat from the start. 

We smile at this recap but the basic vision — 
science as an endeavour of individual creative 
genius that explodes in an instant of discovery 
— is one we take for granted. Yet, as Richard 
Holmes describes in his new book The Age of 
Wonder, it was not always so. Most founders 
of modern science in the 1600s, such as Isaac 
Newton, rarely saw their work this way. For 
them the process was clinical, building on a 
slow accumulation of insight. 

The big shift took place in the decades 
around 1800, a period called the Romantic era. 
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Star poetry 


The Very Small Baseline Group Convenes 
at The Cat and Fiddle 
by Neil Rollinson 


A groaning table of empties makes up 

our Very Small Array — a barley-scented 
interferometer. Here we can study the cosmos 
and drink. We tune in to the microwave sky: 

to the froth at the edge of the universe. 

We sup in the dusk, everything glows 

with its own light: the hedgerow, lawn, 

the atoms inside the glass. The Milky Way 
sings in a half-inch of Guinness 

asong of distant past when the world 

was a moment old. We gather it all in our mugs, 
in a pub garden on the edge of the moors, 
looking down on Jodrell Bank: Queen 

of the red-light district, cocking her huge lug 
to the mayhem beyond our patch. 

The bats are in on it, hunting in ultrasound, 
catching moths in their fangs, while frogs 

bark in the meadows, one to the other, 

a vast unfathomable love-song. | finish my pint 
and add my glass to the phalanx: the more 
we drink 

the clearer we see, as any old soak will tell you. 
| tip back my head to look at the Pleiades 

and tumble, arse over tit, into the damp grass. 
| lie in my cups under the bling of the 
northern sky. 

| can hear it now, | can see it all clearly, 

all and nothing, just the whole sky blazing. 


Poets paired with astronomers have created 

16 new poems, including this one, that form 
part of a collection of OO poems about the 
wonder of the Universe. | 


Dark Matter: Poems of Space 

Edited by Maurice Riordan and 

Jocelyn Bell Burnell 

Calouste Gulbenkian Foundation: 2008. 
208 pp. £8.50 
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The pursuit of progress by thinkers and 
artists became more a wait for a divine spark of 
inspiration than the steady toil of uncovering 
that had been accepted by their predecessors. 
And any means by which that spark could be 
nurtured was embraced. 

To guide readers through the science and 
culture of this period, Holmes masterfully dips 
in and out of the life of Joseph Banks. He is 
an inspired choice. As a curly haired 26-year- 
old, Banks was aboard HM Bark Endeavour on 
the momentous day in April 1769 when it first 
glided into sight of Tahiti. Ostensibly Banks 
was just the expedition’s plant collector, but he 
soon became more fascinated by the island’s 
human inhabitants. 

For most of the British crew — young men 
who had been away from female company 
for long months — the explorations in Tahiti 
were of one sort only, with the initial going rate 
being one ship’s nail for one sexual encoun- 
ter. That rate soon changed — as Holmes 
describes with gentle skill, the Tahitians were 
quick to grasp the workings of the market 
economy, and hada keen eye for the other use- 
ful metal objects aboard the ship. Hyperinfla- 
tion set in, and at one point “there was a crisis 
when one of the Endeavour’s 
crew stole a hundredweight 
bag of nails, and refused to 
reveal its whereabouts even 
after a flogging”. 

Banks looked further. He 
took up Tahitian mistresses 
too, but systematically 
recorded the local language, 
studied their religious 
systems, and even hinted 
at the true functional sig- 
nificance of native actions 
that, at first, seemed to be 
merely bizarre. Within a 
few months he had helped 
set the stage for the modern 
science of anthropology. 


Portrait of Joseph Banks in honour 
of astronomer William Herschel. 


style of discovery. Davy hurried to the mines, 
spent intense weeks with the miners and then 
took himself off to an isolated lab where, using 
his unique genius, he cracked the problem. 

Another of Banks's protégés was William 
Herschel, the immigrant Hanoverian astrono- 
mer. Herschel is most famous now for having 
measured the orbit of Uranus and establishing 
this body as a planet, which almost became 
known as planet George to honour King George 
III. But Herschel also worked out the shape of 
the Milky Way and the Sun’s off-centre position 
in it, and he discovered infrared radiation. 

Although his work relied on the dull accu- 
mulation of observational facts, it was the role 
of sudden insight and genius that Herschel and 
others emphasized in their written accounts. In 
Herschel’s case, it was true to his character: he 
had risen in society by transforming his own life 
and moving to England. Wouldn't he imagine 
there were fresh realms — new planets, stars 
beyond our Solar System, light beyond the vis- 
ible spectrum — to uncover in nature as well? 

Our notion of earlier scientists, including 
Newton, was rewritten to fit this Romantic 
view. The young poet William Wordsworth, 
for example, had famously devalued science 

with his harsh line that when 
we probe the mechanism of a 
natural process, “we murder 
to dissect”. But decades later, as 
the work of Banks and his pro- 
tégés became better known, 
Wordsworth shifted to admire 
science, seeing Newton as the 
archetypal Romantic hero: 
“Voyaging through strange seas 
of Thought, alone” 

Over time, the simplest 
Romantic imagery slid away. 
Few of the hundreds of physi- 
cists working today at CERN, 
the European particle-physics 
lab near Geneva, Switzerland, 
must view themselves as voy- 


Back in London, Banks's 
insights, energy and inherited wealth eventu- 
ally led to him becoming president of the Royal 
Society. From his headquarters, as Holmes 
gracefully phrases it, “his gaze swept stead- 
ily round the globe like some vast, enquir- 
ing lighthouse beam” His own days of direct 
discovery were over, but couldn't other like- 
minded individuals be encouraged to carry on 
such wondrous, intense investigations? 

One of the young men Banks chose to sup- 
port was Humphry Davy: friend of the Roman- 
tic poets, and — in his quick, intense creation of 
a safe coal-miner’s lamp in response to under- 
ground disasters — a man who made himself 
into a perfect exemplar of the new, Romantic 
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aging anywhere alone. But the 
Romantic era left great legacies. Newton's static 
Universe was gone, and Herschel’s dynamic one 
— in which stars evolved and the heavens were 
ever-changing — was in its place. This muta- 
ble view was useful for the next notable young 
man, one Charles Darwin, when he was ready 
to embark on his own voyage of discovery on 
HMS Beagle in 1831. And even among today’s 
doctoral candidates at CERN, who doesn't 
hope that maybe, with enough concentration, 
something very special and very unique could 
still burst out? a 
David Bodanis is a writer based in London, and 
author of Electric Universe. His forthcoming book 
is on the Ten Commandments. 
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interferometer. Here we can study the cosmos 
and drink. We tune in to the microwave sky: 

to the froth at the edge of the universe. 

We sup in the dusk, everything glows 

with its own light: the hedgerow, lawn, 

the atoms inside the glass. The Milky Way 
sings in a half-inch of Guinness 

asong of distant past when the world 

was a moment old. We gather it all in our mugs, 
in a pub garden on the edge of the moors, 
looking down on Jodrell Bank: Queen 

of the red-light district, cocking her huge lug 
to the mayhem beyond our patch. 

The bats are in on it, hunting in ultrasound, 
catching moths in their fangs, while frogs 

bark in the meadows, one to the other, 

a vast unfathomable love-song. | finish my pint 
and add my glass to the phalanx: the more 
we drink 

the clearer we see, as any old soak will tell you. 
| tip back my head to look at the Pleiades 

and tumble, arse over tit, into the damp grass. 
| lie in my cups under the bling of the 
northern sky. 

| can hear it now, | can see it all clearly, 

all and nothing, just the whole sky blazing. 


Poets paired with astronomers have created 

16 new poems, including this one, that form 
part of a collection of OO poems about the 
wonder of the Universe. | 


Dark Matter: Poems of Space 

Edited by Maurice Riordan and 

Jocelyn Bell Burnell 

Calouste Gulbenkian Foundation: 2008. 
208 pp. £8.50 
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The pursuit of progress by thinkers and 
artists became more a wait for a divine spark of 
inspiration than the steady toil of uncovering 
that had been accepted by their predecessors. 
And any means by which that spark could be 
nurtured was embraced. 

To guide readers through the science and 
culture of this period, Holmes masterfully dips 
in and out of the life of Joseph Banks. He is 
an inspired choice. As a curly haired 26-year- 
old, Banks was aboard HM Bark Endeavour on 
the momentous day in April 1769 when it first 
glided into sight of Tahiti. Ostensibly Banks 
was just the expedition’s plant collector, but he 
soon became more fascinated by the island’s 
human inhabitants. 

For most of the British crew — young men 
who had been away from female company 
for long months — the explorations in Tahiti 
were of one sort only, with the initial going rate 
being one ship’s nail for one sexual encoun- 
ter. That rate soon changed — as Holmes 
describes with gentle skill, the Tahitians were 
quick to grasp the workings of the market 
economy, and hada keen eye for the other use- 
ful metal objects aboard the ship. Hyperinfla- 
tion set in, and at one point “there was a crisis 
when one of the Endeavour’s 
crew stole a hundredweight 
bag of nails, and refused to 
reveal its whereabouts even 
after a flogging”. 

Banks looked further. He 
took up Tahitian mistresses 
too, but systematically 
recorded the local language, 
studied their religious 
systems, and even hinted 
at the true functional sig- 
nificance of native actions 
that, at first, seemed to be 
merely bizarre. Within a 
few months he had helped 
set the stage for the modern 
science of anthropology. 


Portrait of Joseph Banks in honour 
of astronomer William Herschel. 


style of discovery. Davy hurried to the mines, 
spent intense weeks with the miners and then 
took himself off to an isolated lab where, using 
his unique genius, he cracked the problem. 

Another of Banks's protégés was William 
Herschel, the immigrant Hanoverian astrono- 
mer. Herschel is most famous now for having 
measured the orbit of Uranus and establishing 
this body as a planet, which almost became 
known as planet George to honour King George 
III. But Herschel also worked out the shape of 
the Milky Way and the Sun’s off-centre position 
in it, and he discovered infrared radiation. 

Although his work relied on the dull accu- 
mulation of observational facts, it was the role 
of sudden insight and genius that Herschel and 
others emphasized in their written accounts. In 
Herschel’s case, it was true to his character: he 
had risen in society by transforming his own life 
and moving to England. Wouldn't he imagine 
there were fresh realms — new planets, stars 
beyond our Solar System, light beyond the vis- 
ible spectrum — to uncover in nature as well? 

Our notion of earlier scientists, including 
Newton, was rewritten to fit this Romantic 
view. The young poet William Wordsworth, 
for example, had famously devalued science 

with his harsh line that when 
we probe the mechanism of a 
natural process, “we murder 
to dissect”. But decades later, as 
the work of Banks and his pro- 
tégés became better known, 
Wordsworth shifted to admire 
science, seeing Newton as the 
archetypal Romantic hero: 
“Voyaging through strange seas 
of Thought, alone” 

Over time, the simplest 
Romantic imagery slid away. 
Few of the hundreds of physi- 
cists working today at CERN, 
the European particle-physics 
lab near Geneva, Switzerland, 
must view themselves as voy- 


Back in London, Banks's 
insights, energy and inherited wealth eventu- 
ally led to him becoming president of the Royal 
Society. From his headquarters, as Holmes 
gracefully phrases it, “his gaze swept stead- 
ily round the globe like some vast, enquir- 
ing lighthouse beam” His own days of direct 
discovery were over, but couldn't other like- 
minded individuals be encouraged to carry on 
such wondrous, intense investigations? 

One of the young men Banks chose to sup- 
port was Humphry Davy: friend of the Roman- 
tic poets, and — in his quick, intense creation of 
a safe coal-miner’s lamp in response to under- 
ground disasters — a man who made himself 
into a perfect exemplar of the new, Romantic 
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aging anywhere alone. But the 
Romantic era left great legacies. Newton's static 
Universe was gone, and Herschel’s dynamic one 
— in which stars evolved and the heavens were 
ever-changing — was in its place. This muta- 
ble view was useful for the next notable young 
man, one Charles Darwin, when he was ready 
to embark on his own voyage of discovery on 
HMS Beagle in 1831. And even among today’s 
doctoral candidates at CERN, who doesn't 
hope that maybe, with enough concentration, 
something very special and very unique could 
still burst out? a 
David Bodanis is a writer based in London, and 
author of Electric Universe. His forthcoming book 
is on the Ten Commandments. 
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Hidden treasures: the Paris Observatory 


Giovanni Domenico Cassini helped to create an institution that pinpointed Neptune, showed that 
light had a finite speed — and even mapped France, explains Alison Abbott. 


Giovanni Domenico Cassini 
was a mathematical genius 
with finely tuned social skills, 
a quick wit and a habit of 
enthralling powerful political 
figures. In 1650, when Cassini 
was just 25, the Pope appointed him profes- 
sor of astronomy at his university in Bologna, 
Italy. He gave Cassini the fattest salary there, 
to the chagrin of his more established col- 
leagues, and called on him to help resolve 
some of the highest issues of state. These var- 
ied from escorting Sweden’s Queen Chris- 
tina to Rome when she wished to convert to 
Catholicism — a brief romance apparently 
sparked between them during the long jour- 
ney — to determining the direction of water 
flow in the Apennine mountain range that 
bordered the papal states, to designing papal 
fortifications. 

Meanwhile, France’s Louis XIV, the Sun 
King, was hatching grandiose plans for his 
country — among them an academy of sci- 
ences and arts and an observatory for experi- 
mental sciences, particularly astronomy. He 
commissioned his minister, Jean-Baptiste 
Colbert, to scour Europe for the best scien- 
tists and artists. In 1668, Colbert made Cas- 
sini a financial offer he could not refuse. With 
the indulgence of the Pope, who was keen to 
maintain good relations with the French king 
during a time of political and religious ten- 
sions, Cassini set off immediately for Paris. 
There he joined the fledgling academy and 
served as royal adviser on the building of the 
observatory. It was supposed to be a temporary 
visit. The university in Bologna griped about 
the Pope's insistence to continue paying him, 
even when it became clear Cassini wouldn't 
return. The Pope turned a deaf ear. 

Louis XIV loved Cassini from the beginning. 
Although not formally the first director of the 
observatory, which was completed in 1682, 
he assumed the role in practice. Just five years 
after he arrived, the king bestowed French 
citizenship on him. 

His privileged relationship with Louis XIV 
was as much a source of annoyance to some 
Parisian colleagues as his relationship to 
the Pope had been in Bologna. But Cassini, 
with his large personality and outstanding 
scientific achievements — ranging from the 
discovery of a gap in the rings of Saturn to 
unveiling the structure of snowflakes — was 


Cassini relied on an artist to depict his Moon views. 


a major influence on the development of 
French science. 

The king was reportedly less pleased with 
Cassini's great expedition in the early 1670s. 
Together with Jean-Felix Picard and Philippe 
de La Hire, he remapped France using astron- 
omy, ingeniously exploiting Galileo’s 1610 
discovery of four satellites of Jupiter. They 
used the eclipses of these satellites as time ref- 
erence points to measure longitude differences 
between locations on the coasts of France. The 
resultant map showed France to be narrower 
than previously thought, with the kingdom 
losing 20% of its surface area. Poor recom- 
pense, the king said, for the solicitude he had 
shown his astronomers. 

The observatory was built in the country- 
side, some way from the centre of Paris. Louis 
XIV had fondly imagined it would be the acad- 
emy’s meeting place, but the scientists found 
that it took too long to get there and contin- 
ued to meet in town. Today it has been swal- 
lowed up by the city, sheltered from the roar 
of traffic behind walls that encircle generous 
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gardens. Visitors will appreciate why Cassini 
argued, unsuccessfully, with the architects in 
1668: the space is hardly optimal for scientific 
endeavour. It looks more like a palace than a 
place of work. 

But it resounds with history. Vast knowl- 
edge was accumulated here, from the 1676 
discovery that the speed of light was finite, to 
the accurate pinpointing of Neptune's posi- 
tion using the observed perturbations of the 
orbit of Uranus. The high-ceilinged, cavernous 
rooms are now ideal for displaying the massive 
wood-and-brass astronomical instruments that 
form much of the collection of the observa- 
tory’s museum, created in 1878. Although 
by then some of the oldest instruments had 
been lost during expeditions, more than 500 
objects remain, including the telescope of Léon 
Foucault — more famous for his pendulum — 
and the first to use the thin, silver-plated mir- 
rors found in modern telescopes. 

The library, founded in 1785, has also 
benefited from major acquisitions. During 
and just after the French Revolution in the 
1790s, Cassini’s great-grandson donated his 
forebear’s library and extensive records of 
astronomical observations, as well as those 
of the three Cassini generations who sub- 
sequently directed the observatory — their 
dynasty lasted for more than a century. Then, 
the eighteenth-century astronomer Joseph- 
Nicolas Delisle sold the library a vast number 
of astronomy manuscripts that he had 
assiduously collected during his decades of 
travels around Europe and Russia. Most val- 
uable are the 16 boxes of manuscripts by the 
great Polish astronomer Johannes Hevelius, 
including all his correspondence. 

The library’s greatest treasures are perhaps 
the original drawings of the surface of the 
Moon (as pictured) made by Cassini's engraver, 
who sat beside the astronomer as he made his 
observations. For all his attributes, Cassini was 
nota gifted artist. These shaded pastel draw- 
ings, made under his guidance and annotated 
in his own hand, have rarely been seen. Those 
fortunate enough to examine them will experi- 
ence arush — such a direct emotional connec- 
tion with history is rare, and powerful. a 
Alison Abbott is Nature's senior European 
correspondent. 


See http://tinyurl.com/6yxa68 for details. 
See Editorial, page 7, and www.nature.com/astro09. 
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QUANTUM PHYSICS 


Squeeze until it hurts 


Geoff J. Pryde 


Quantum systems are uncertain by nature. By ‘squeezing’ this uncertainty, physicists can make better 
measurements of quantities such as distance. But overdoing it makes things burst out all over the place. 


At the leading edge of experimental science, 
the latest measurement techniques are prom- 
ising to provide breakthroughs in our under- 
standing of the Universe. The ever-improving 
ability to sense small displacements, for exam- 
ple, is at the heart of projects such as the Laser 
Interferometer Gravitational Wave Observa- 
tory (LIGO)', which seeks to observe the faint 
space-time ripples of distant supernovae. 
When technical noise is strongly suppressed, 
the ultimate limit to the precision of any meas- 
urement is set by the quantum uncertainty in 
the measuring system. But even this quantum 
uncertainty can be reduced — a technique 
known as ‘squeezing. On page 67 of this issue, 
Shalm et al.” show that squeezing down this 
quantum uncertainty is not as simple as might 
be expected — too much squeezing actually 
worsens measurement precision. Fortunately, 
they also show that it is possible to recover the 
best precision allowed by the laws of physics 
by looking at the ‘over-squeezed’ system in a 
different way. 

That a fundamental limit to measurement 
precision exists at all is a purely quantum 
phenomenon. Consider light, the basis of a 
suite of sensitive interferometric measurement 
techniques. In the classical picture, light is a 
wave whose amplitude and phase — where the 
wave's peaks and troughs lie — can be speci- 
fied with infinitesimal precision. But in reality, 
light has much more character. It is made up of 
indivisible photons that exhibit probabilistic 
behaviour when forced to decide which quan- 
tum state, out of a range of options presented, 
to be in. 

Shine a hypothetical classical beam on a 
beam splitter that reflects 50% and transmits 
50% of the light, and you know that exactly half 
goes each way. Puta grainy beam — composed 
of N photons, say — on that same beam split- 
ter and it is exactly like tossing a coin N times. 
You know that on average each outcome should 
occur N/2 times, but there is some uncertainty 
because, in a given trial, you might get more 
heads than tails, for instance. 

A picture may help to understand squeezing. 
Imagine a stick lying in a plane (Fig. 1a) such 
that its length represents the amplitude of the 
light and its angle represents the phase. The 


Amplitude 


Amplitude 


Figure 1| Quantum uncertainties. Conceptual 
representation of the quantum state of light: the 
length of the red stick represents the amplitude 

of the light, and its angle ¢ represents the phase. 
The ‘blob’ represents the quantum uncertainty 

in knowing the amplitude and phase. The blob is 
usually a Gaussian probability distribution, and 
the axes E, and E,, represent the quadratures of 
the electric field E at frequency a: classically, 
E=E, cos (wt) + Eg, sin (wt). a, An unsqueezed 
state; b, a squeezed state. In this example, the 
amplitude uncertainty is squeezed (decreased) and 
the phase uncertainty is anti-squeezed (increased). 


probabilistic nature of photons smears out the 
end of the stick, leading to uncertainty in the 
measurement of phase and amplitude. How- 
ever, quantum theory tells us we can squeeze 
this uncertainty ‘blob’ in one direction and 
it will expand in another direction, just like 
a balloon filled with water. So, squeezing the 
uncertainty in amplitude leads to an increased 
phase uncertainty (Fig. 1b) — but that’s all right 
if it is just the amplitude that we are trying to 
measure precisely. 

Squeezing itself* has been demonstrated 
many times in a regime in which squeezing 
harder just makes things better and better. 
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But it has been predicted that, in systems with 
limited dimensions, squeezing harder can make 
things worse*. Shalm et al.” have been able to 
investigate this fascinating situation experi- 
mentally by confining themselves to a quan- 
tum system composed of just three photons, 
which they mash together into the same region 
of time, space and frequency spectrum. 

The quantity that Shalm and colleagues 
measure is not amplitude or phase, but rather 
polarization — a property that describes 
whether the electric-field vector of the light 
oscillates vertically, horizontally, in a circle, 
or any combination of the above. These three- 
photon polarization states are not represented 
by astickin a plane, but rather by a stick going 
from the centre of a sphere to the surface, and 
the quantum uncertainty is represented by a 
blob on that surface (Fig. 2a, overleaf). It is 
now apparent what happens when squeezing 
occurs: although the uncertainty is reduced in 
one direction, the blob starts to wrap around 
the sphere in the other (Fig. 2b). Eventually 
it goes all the way around and meets up — 
although the three-fold symmetry leads to a 
rather more interesting pattern than one might 
initially expect (Fig. 2c). This over-squeezing 
is problematic, however. Rather than reducing 
the uncertainty in the specified direction, the 
over-squeezing has spread the initial uncer- 
tainty into three blobs equally spaced around 
aring, meaning that it is now even more uncer- 
tain where the end of the stick is — that is to 
say, what the light’s polarization is. 

Does this mean that there is an optimal 
amount of squeezing — not too little and not 
too much? If one is limited to detecting all the 
photons in one bunch (an intensity measure- 
ment), the answer is ‘yes. But when the pho- 
tons were counted one by one, Shalm et al. were 
able to reveal the correlations between them 
that give rise to the three fringes around the 
equator. As this maximally squeezed state is 
identical to the highly entangled NOON state 
proposed for quantum metrology’, there exist 
known adaptive-measurement algorithms 
to extract the complete information in an 
efficient way’. 

So where does all of this leave us? Shalm 
and colleagues have elegantly demonstrated 
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BIOS NIC) EARS AIGIC. 


50 YEARS AGO 

The largest and heaviest 
satellite so far placed in orbit 

by the United States was 
launched in a south-easterly 
direction from Cape Canaveral 
in Florida at 23.02 u.T. on 
December 18. The complete 
satellite weighs some 8,500 lb., 
of which only a small proportion, 
about 150 lb., is pay-load. The 
remainder of the object is the 
empty shell of the Atlas missile 
which performed the launching, 
about 80 ft. long and 10 ft. in 
diameter. The main purpose 

of the satellite is to act asa 
radio relay station, storing and 
re-transmitting, on frequencies 
of 132.45 and 139.905 Mc./s., 
messages sent to it from the 
ground. It also carries telemetry 
transmitters operating on 107.97 
and 107.94 Mc./s. The satellite 
is the first to be successfully 
launched since July 1958, and is 
designated 1958 ¢. 

From Nature 3 January 1959. 


100 YEARS AGO 

Fads and Feeding. By Dr. C. 
Stanford Read — This is an 
admirably clear, well reasoned, 
and sensible little book. One 
can only hope that it may 

be widely read and may do 
something to counteract the 
ridiculous views on diet which 
are the result of the cogitation 
of that dangerous class of 
people who, having alittle 
knowledge, supply the remainder 
from their own imagination. 

In spite of the importance of a 
suitable diet for health, there 

is perhaps no other subject 
which breeds so many fads... 
The key-word of Dr. Read's book 
is moderation; moderation in 
meat-eating, in tea-drinking, 

in the use of alcoholic drinks 
and the like. He is also moderate 
in the way he deals with the 
faddists, the vegetarians, the 
uric-acid-free dietists, the 
teetotalers, and the rest. 
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Figure 2 | Squeezing a triphoton. The position of the quantum state on the surface of a sphere — seen 
here in two-dimensional projections — describes the polarization of a three-photon (triphoton) state. 
The directions S, S, and S represent the degree of horizontal, 45° and right-circular polarizations, 
respectively. The colours correspond to different levels of probability for the polarization state (red 
for highest probability, blue for lowest); the red and yellow regions represent the quantum uncertainty 
blob. The uncertainty in the S, direction is reduced as the squeezing is increased (a-c). a, An 
unsqueezed state. b, As the squeezing is increased, the uncertainty in the S, direction is decreased at 
the expense of increased uncertainty in the S, direction. ¢, Shalm et al.” show that in the over-squeezed 
case the uncertainty blob completely wraps around the sphere. Although the regions of uncertainty 
are small in the S, direction, there are three of them — as shown in an alternative projection (right) — 
leading to a large overall uncertainty. (Modified from ref. 2.) 
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CANCER 


Inflaming metastasis 


Alberto Mantovani 


Cancer can be defined by six hallmarks, including uncontrollable growth, 
immortality and the ability to invade other tissues. Increasing evidence 
suggests that a seventh feature should make this list — inflammation. 


Malignant tumours are characterized by 
their ability to metastasize, that is, to invade 
anatomically distant normal tissues and to 
seed and grow there. During this complex 
and highly selective process, tumour cells leave 
their primary site and disseminate by various 
routes, such as the blood and lymph vessels. 
Not all cancer cells can metastasize’, because 
successful metastasis depends both on intrinsic 
properties of the tumour cells and on factors 
derived from the tumour microenvironment. 
For example, the microenvironment provides 
blood and lymphatic vessels in and around the 
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tumour, an inflammatory milieu consisting 
of immune cells and their secretory products, 
and a scaffold in the form of the extracellu- 
lar matrix for further growth. Writing in this 
issue, Kim et al.” shed light on the unexpected 
molecular pathways that link inflammation in 
the tumour microenvironment to metastasis 
(page 102). 

The link between inflammation and cancer 
is well documented**. Several inflammatory 
diseases, including inflammatory bowel dis- 
ease, increase the risk of cancer. Conversely, in 
tumours that are epidemiologically unrelated 
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by the United States was 
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direction from Cape Canaveral 
in Florida at 23.02 u.T. on 
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satellite weighs some 8,500 lb., 
of which only a small proportion, 
about 150 lb., is pay-load. The 
remainder of the object is the 
empty shell of the Atlas missile 
which performed the launching, 
about 80 ft. long and 10 ft. in 
diameter. The main purpose 
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radio relay station, storing and 
re-transmitting, on frequencies 
of 132.45 and 139.905 Mc./s., 
messages sent to it from the 
ground. It also carries telemetry 
transmitters operating on 107.97 
and 107.94 Mc./s. The satellite 
is the first to be successfully 
launched since July 1958, and is 
designated 1958 ¢. 

From Nature 3 January 1959. 


100 YEARS AGO 

Fads and Feeding. By Dr. C. 
Stanford Read — This is an 
admirably clear, well reasoned, 
and sensible little book. One 
can only hope that it may 

be widely read and may do 
something to counteract the 
ridiculous views on diet which 
are the result of the cogitation 
of that dangerous class of 
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knowledge, supply the remainder 
from their own imagination. 

In spite of the importance of a 
suitable diet for health, there 

is perhaps no other subject 
which breeds so many fads... 
The key-word of Dr. Read's book 
is moderation; moderation in 
meat-eating, in tea-drinking, 

in the use of alcoholic drinks 
and the like. He is also moderate 
in the way he deals with the 
faddists, the vegetarians, the 
uric-acid-free dietists, the 
teetotalers, and the rest. 
Perhaps this method of dealing 
with them is the most effectual 
with the public, who, taken as 
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Figure 2 | Squeezing a triphoton. The position of the quantum state on the surface of a sphere — seen 
here in two-dimensional projections — describes the polarization of a three-photon (triphoton) state. 
The directions S, S, and S represent the degree of horizontal, 45° and right-circular polarizations, 
respectively. The colours correspond to different levels of probability for the polarization state (red 
for highest probability, blue for lowest); the red and yellow regions represent the quantum uncertainty 
blob. The uncertainty in the S, direction is reduced as the squeezing is increased (a-c). a, An 
unsqueezed state. b, As the squeezing is increased, the uncertainty in the S, direction is decreased at 
the expense of increased uncertainty in the S, direction. ¢, Shalm et al.” show that in the over-squeezed 
case the uncertainty blob completely wraps around the sphere. Although the regions of uncertainty 
are small in the S, direction, there are three of them — as shown in an alternative projection (right) — 
leading to a large overall uncertainty. (Modified from ref. 2.) 
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CANCER 


Inflaming metastasis 


Alberto Mantovani 


Cancer can be defined by six hallmarks, including uncontrollable growth, 
immortality and the ability to invade other tissues. Increasing evidence 
suggests that a seventh feature should make this list — inflammation. 


Malignant tumours are characterized by 
their ability to metastasize, that is, to invade 
anatomically distant normal tissues and to 
seed and grow there. During this complex 
and highly selective process, tumour cells leave 
their primary site and disseminate by various 
routes, such as the blood and lymph vessels. 
Not all cancer cells can metastasize’, because 
successful metastasis depends both on intrinsic 
properties of the tumour cells and on factors 
derived from the tumour microenvironment. 
For example, the microenvironment provides 
blood and lymphatic vessels in and around the 
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tumour, an inflammatory milieu consisting 
of immune cells and their secretory products, 
and a scaffold in the form of the extracellu- 
lar matrix for further growth. Writing in this 
issue, Kim et al.” shed light on the unexpected 
molecular pathways that link inflammation in 
the tumour microenvironment to metastasis 
(page 102). 

The link between inflammation and cancer 
is well documented**. Several inflammatory 
diseases, including inflammatory bowel dis- 
ease, increase the risk of cancer. Conversely, in 
tumours that are epidemiologically unrelated 
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to overt inflammatory conditions 
(such as breast cancer), the activation 
of oncogenes can orchestrate the pro- 
duction of inflammatory molecules 
and the recruitment of inflammatory 
cells. In the tumour microenviron- 
ment, inflammatory cells and 
molecules influence almost 
every aspect of cancer progress, 
including the tumour cells’ abil- 

ity to metastasize’. Thus, whereas 
there were previously six recognized 
hallmarks of cancer — unlimited 
replicative potential, self-suffi- 
ciency in growth signals, insensi- 
tivity to growth inhibitors, evasion 
of programmed cell death, ability 
to develop blood vessels, and tissue 
invasion and metastasis’ — cancer- 
related inflammation now emerges 
as number seven (Fig. 1). 

A group of cytokine proteins, 
including IL-1, IL-6, TNF and 
RANKL, activate inflammation and 
are known to augment tumour cells’ 
ability to metastasize by affecting 
several steps in the cells’ dissemina- 
tion and implantation at secondary 
sites”®’. Inflammatory cytokines 
lie downstream of the ‘master’ gene 
transcription factor for promot- 
ing inflammation — NF-«B — which is itself 
activated by them’. A major source of inflam- 
matory cytokines in the tumour microen- 
vironment are specialized white blood cells 
called macrophages. Tumour-associated 
macrophages assist the malignant behav- 
iour of tumour cells, not just by producing 
cytokines, but also by secreting growth factors 
and matrix-degrading enzymes*”. 

Kim et al.” explored the molecular path- 
ways linking tumour cells, macrophages and 
metastasis. By purifying the components of the 
medium in which the tumour cells (the Lewis 
lung carcinoma cell line) were grown, they 
isolated a factor that induced cytokine pro- 
duction by macrophages. They identified this 
tumour-derived macrophage activator as ver- 
sican, a protein of the extracellular matrix that 
is frequently upregulated in human tumours. 
The authors found that versican is recognized 
by TLR2 and TLR6, two receptor proteins 
that belong to a family of cellular sensors of 
microbially derived molecules and tissue dam- 
age. They then went on to silence versican in 
tumour cells by an RNA interference technique, 
and to use mice in which TNF and TLR were 
absent. On the basis of the evidence obtained, 
the authors propose that, in the Lewis lung car- 
cinoma model, tumour-derived versican acts 
on macrophages through TLR2/TLR6, leading 
to the production of inflammatory cytokines, 
which enhance metastasis. 

Kim and colleagues’ observations highlight 
the importance of the extracellular matrix in 
cancer-related inflammation. The matrix acts 
as a depot of cytokines and growth factors, in 


Evasion of 
apoptosis 


Sustained 
angiogenesis 


An inflammatory 
microenvironment 


Tissue invasion 
& metastasis 


Insensitivity to 
growth inhibitors 


Self-sufficiency in 
growth signals 


Limitless replicative 
potential 


Figure 1| The hallmarks of cancer. In 2000, Hanahan 

and Weinberg’ proposed a model to define the six 
properties that a tumour acquires. These are unlimited 
replicative potential, ability to develop blood vessels 
(angiogenesis), evasion of programmed cell death 
(apoptosis), self-sufficiency in growth signals, insensitivity 
to inhibitors of growth, and tissue invasion and metastasis. 
Kim and colleagues’ findings’, together with those of 
other studies*’, indicate that this model should be revised 
to include cancer-related inflammation as an additional 
hallmark. (Adapted from ref. 5.) 


particular vascular endothelial growth fac- 
tor, which is mobilized by enzymes originat- 
ing from inflammatory white blood cells and 
which promotes blood-vessel formation dur- 
ing tumour progression*”. Moreover, during 
the development of cancers caused by human 
papillomavirus, immune cells called B cells 


orchestrate inflammation remotely 
by producing antibodies that become 
localized in the extracellular matrix". 
What's more, a macrophage-derived 
extracellular-matrix protein called SPARC 
facilitates tumour-cell motility and meta- 
stasis’”. So it seems that extracellular- 
matrix components are much more than 
a scaffold, or a substrate to be consumed 
during tumour-cell invasion, but instead 
represent a central component of cancer- 
related inflammation. 

The present study* offers unexpected 
vistas on the molecular pathways that link 
inflammation to acquisition of the capacity 
to metastasize during tumour progression. 
It will be essential to assess the significance 

of versican and other extracellular-matrix 
proteins in models that reflect the diversity of 
human cancer, for from such work innovative 
therapeutic strategies may follow. a 
Alberto Mantovani is at the Istituto Clinico 
Humanitas IRCCS, and the University of Milan, 
Rozzano, Milan 200839, Italy. 

e-mail: alberto. mantovani@humanitas.it 
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ASTROPHYSICS 


Star formation branches out 


Ralph E. Pudritz 


Deciphering how stars form within turbulent, dense clouds of 
molecular gas has been a challenge. An innovative technique that 
uses a tree diagram provides insight into the process. 


An understanding of how stars form has been 
hampered by the complexity of the clouds 
of cold molecular gas within which their 
formation occurs. To elucidate this process, 
the effects of gravity must be traced across a 
wide range of scales, particularly at the large 
distances over which it operates in these gas 
clouds. On page 63 of this issue, Goodman 
et al.' show howa hierarchical tree diagram — 
a ‘dendrogram’ — can be used to disentangle 
the gravitational connections that tie the gas 
together on many different scales. 

The movements of gas in molecular clouds 
are measured by spectral shifts. Depending on 
whether a parcel of gas is moving towards or 
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away from the observer, the wavelength of the 
millimetre emission from a molecule such as 
carbon monoxide is shifted towards shorter 
(‘blueshifted’) or longer (‘redshifted’) wave- 
lengths. By measuring these Doppler shifts at 
each point on the sky, one can determine the 
relative velocities of parcels of gas in the cloud 
along each line of sight. It turns out that the 
gas motions in such clouds are mainly highly 
supersonic. Indeed, computer simulations 
show’ that the network of dense filamentary 
structures seen in clouds is probably a direct 
consequence of such supersonic gas flows 
(Fig. 1). 

As with many astronomical observations, 
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ASTROPHYSICS 


Star formation branches out 


Ralph E. Pudritz 


Deciphering how stars form within turbulent, dense clouds of 
molecular gas has been a challenge. An innovative technique that 
uses a tree diagram provides insight into the process. 


An understanding of how stars form has been 
hampered by the complexity of the clouds 
of cold molecular gas within which their 
formation occurs. To elucidate this process, 
the effects of gravity must be traced across a 
wide range of scales, particularly at the large 
distances over which it operates in these gas 
clouds. On page 63 of this issue, Goodman 
et al.' show howa hierarchical tree diagram — 
a ‘dendrogram’ — can be used to disentangle 
the gravitational connections that tie the gas 
together on many different scales. 

The movements of gas in molecular clouds 
are measured by spectral shifts. Depending on 
whether a parcel of gas is moving towards or 
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away from the observer, the wavelength of the 
millimetre emission from a molecule such as 
carbon monoxide is shifted towards shorter 
(‘blueshifted’) or longer (‘redshifted’) wave- 
lengths. By measuring these Doppler shifts at 
each point on the sky, one can determine the 
relative velocities of parcels of gas in the cloud 
along each line of sight. It turns out that the 
gas motions in such clouds are mainly highly 
supersonic. Indeed, computer simulations 
show’ that the network of dense filamentary 
structures seen in clouds is probably a direct 
consequence of such supersonic gas flows 
(Fig. 1). 

As with many astronomical observations, 
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Figure 1| Stellar nursery. The image shows a computer simulation’ of the formation of stars within 

a turbulent, self-gravitating cloud of gas. The initial mass of this star-cluster-forming cloud, which is 
modelled as a sphere of uniform density, is 500 solar masses. The sphere’s radius is 83,300 astronomical 
units (1 Au is the mean distance between Earth and the Sun) and its temperature is 10 kelvin. Supersonic 
turbulence compresses the gas into many filaments and smaller, dense regions. The simulation is viewed 
after one free-fall time — the time taken for a gas parcel to collapse freely to the cloud centre — has 
elapsed, which is 1.9x 10° years for these simulation parameters. The white dots correspond to small, 
dense gas ‘cores’ that collapse to form individual stars. These would correspond physically to regions 
such as those denoted by the billiard balls in Figure 1 of the paper by Goodman et al.’ (page 63). 


however, we do not know the distance of any 
object (gas in this case) from Earth without 
using further painstaking methods. In observ- 
ing molecular clouds, one is limited to measur- 
ing the two-dimensional position of the total 
gas emission on the sky. This measurement 
involves two coordinates that are akin to lati- 
tude and longitude on Earth’s surface, as well 
as the relative velocity of gas at that position. 
Thus, a map of such a cloud is a sequence of 
position-position—velocity (p—p—v) measure- 
ments of the gas across the whole cloud. But 
without the ability to observe the full, three- 
dimensional gas cloud, how can its true struc- 
ture be deduced, let alone the strength of the 
various forces that control where and when 
stars will form? 

The approach usually taken to tackle 
this problem consists of segmenting the 
clumpy cloud into a collection of structures 
(clumps) using a computer program called 
CLUMPFIND. The end result is analogous to a 
topographical relief map of a mountain range. 
Such a map typically shows peaks that stand 
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out from ridges or are isolated, and provide 
a series of contours that demarcate different 
elevations. If one now decided to break the 
range up into discrete ‘mountains, one would 
identify the peaks and, using the various 
contour levels, try to decide whether smaller 
outcrops ‘belonged’ to a given mountain or 
were independent structures. In a p—p-v map 
of a molecular cloud, it is the contours and 
peaks of gas-column density — the sum of the 
emission from all gas parcels moving at a given 
velocity along a given line of sight — that play 
the role of topological relief in this mountain- 
range analogy. 

The problem with this approach arises as 
soon as the results are used to try to provide 
insight into how stars form. For example, the 
column densities allow one to measure the 
masses of the clumps. One can then count 
the number of clumps with a given mass. 
The resulting distribution of clump masses 
is used to work out how star formation 
might occur**. With programs similar to 
CLUMPFIND, the data suggest that the 
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clump-mass distribution closely resembles 
the distribution of star masses. A plausible but 
debated inference is that the origin of stellar 
masses may derive directly from the turbulent 
process that produced the clumps. 

The fly in this ointment, however, as Good- 
man eft al.' show (see Fig. 1 of their Supplemen- 
tary Information), is that if one adopts different 
threshold levels for contouring the maps, the 
column-density distribution changes. This is 
similar to the situation in a topographical relief 
map of a mountain range: the list of discrete 
mountains and their properties depends on 
the choice of threshold level picked to define 
mountains and smaller outcrops. This is unset- 
tling — such an approach does not provide a 
completely objective way of measuring the 
actual distribution of column densities. 

Enter the dendrogram technique advocated. 
by the authors’. Rather than dividing the p-p-v 
data into a priori distinct structures in a sub- 
jective way, they use a method that is sensitive 
to the structures’ intrinsic hierarchy (struc- 
ture within structure). The data are broken up 
into ‘leaves, ‘branches and ‘trunks’ Leaves are 
identified as sufficiently strong maxima in the 
column-density maps, and connections 
between them are made by branches (their 
environs). The collection of physically related 
branches defines a trunk. 

Every point on the dendrogram corresponds 
to aclosed ‘isosurface’ on the map that encloses 
one or more column-density maxima (see 
Fig. 2 on page 64). The authors then show how 
physical properties can be ascribed to regions 
within these isosurfaces. A critical property is 
the ‘virial’ parameter — the ratio of the energy 
of the gas motions (which depends on the gas 
velocity in the region) to the gravitational 
energy (which depends on the mass and size 
of the region). If this ratio is sufficiently small, 
the gas in that region will be self-gravitating 
and prone to form stars. The dendrogram thus 
traces the relative strength of the gravitational 
force across the gas cloud. 

Application of the dendrogram technique 
to observations of the gas cloud L1448, and to 
computer simulations in which only turbulence 
—and not self-gravity — is taken into account, 
shows inconsistencies. In contrast to the 
simulations, in which most of the gas is found 
to be self-gravitating on all spatial scales, the 
observations show that, although a large frac- 
tion of the gas is self-gravitating at large scales, 
at smaller scales that fraction is much lower. 

Interestingly, strong local column-density 
maxima — which correspond to the dense 
gas ‘cores’ in which stars are observed to form 
— are sparser in Goodman and colleagues’ 
dendrogram than those found with the 
CLUMPFIND algorithm, and turn out to be 
in larger regions of self-gravitating gas. And 
here we arrive at what may be the most tanta- 
lizing point of all. A debate that has enlivened 
star-formation theory for nearly a decade is 
how stars acquire their mass’. Do they accrete 
their gas from relatively isolated cores, or do 
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they accrete material as they move about ina 
broader gravitational potential, gathering mass 
through competition with other dense regions 
in the same gravitationally bound region® (the 
‘competitive accretion’ picture)? Although 
neither hypothesis is amenable to definitive 
observational tests, the dendrogram method 
developed by Goodman eft al. has the potential 
to answer this question and to identify the real 
conditions in which stars form. a 
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GAME THEORY 


How to treat those of ill repute 


Bettina Rockenbach and Manfred Milinski 


A much-needed theoretical analysis deals with whether the principle known 
as ‘costly punishment’ helps to maintain cooperation in human society. It 
will prompt a fresh wave of experiments and theory. 


Human societies are built on cooperation, 
especially on reciprocation’ — I help you and 
you help me, or I help you and someone else 
helps me. In the first case, help is directly recip- 
rocated by help. In the second, called indirect 
reciprocity, I gain a good reputation and so I 
can expect help when in need. 

But how shall I treat someone with a bad 
reputation? Shall I just refuse help or shall I 
punish this person at a cost to myself? Costly 
punishment can enhance cooperation” 
in experiments with human subjects, but 
potentially with no net benefit*: the costs of 
punishment usually, although not always’, 
neutralize gains from enhanced cooperation. 
On page 79 of this issue, Ohtsuki et al.” describea 
theoretical test of whether either refusing 
help to or punishing someone with a bad 
reputation might lead to a cooperative soci- 
ety. They conclude that, except under certain 
rare conditions, punishment does not produce 
that outcome. 

When you meet someone needing help, you 
can help (cooperate), refuse to help (defect) 
or not only refuse to help but, in addition, 
decrease the needy person’s wealth (punish). 
Both cooperation and punishment are costly 
for you, but respectively create a larger ben- 
efit or larger loss for the person needing help. 
Defection is cost neutral. 

How you behave depends on the reputation 
— good or bad — of the needy person, and 
depends upon your ‘action rule. An example 
is ‘cooperate with someone with a good rep- 
utation and defect with someone with a bad 
reputation (CD). The reputation you yourself 
gain by applying your action rule depends on 
the social norm of your society. Under the 
norm ‘stern-judging, for example, you gain 
a good reputation when cooperating with good 
or when defecting with bad, and abad reputation 


in all other cases. Thus CD always leads to a 
good reputation under stern-judging. 
Another action rule, CP, prescribes ‘cooperate 
with good and punish bad’. Under stern- 
judging, with CP you will achieve a good repu- 
tation when you interact with someone with 
a good reputation and a bad reputation when 
you interact with someone with a bad reputa- 
tion. But under a different social norm, ‘shun- 
ning’ (cooperation with good or punishment of 
bad leads to a good reputation), CP will always 
provide you with a good reputation (Fig. 1). 
In their simplest model, Ohtsuki et al.° 
assume that everybody has the same opinion 
of the reputation of another person or has 
the same level of fallibility in assigning an 


Alice's reputation 


incorrect reputation. For such a society, they 
test for each of the 64 different social norms 
(Fig. 1) whether an action rule exists that both 
generates a cooperative society and is evolu- 
tionarily stable — meaning one that resists 
replacement (invasion) by any of the other 
eight possible action rules (Fig. 1). 

Ohtsuki et al. find that the two action rules 
that induce cooperation and resist invasion 
are those described above — CD under stern- 
judging and CP under shunning. Nonetheless, 
the average pay-off is lower if the action rule 
uses costly punishment, while the stability 
conditions are less restrictive. 

However, which parameters determine which 
rule is most efficient in the sense of leading to 
the highest average pay-off at equilibrium? It 
turns out that a crucial one is the accuracy of 
assigning the correct reputation to everybody. 
If this accuracy is too low then only a DD 
action rule is efficient, under which nobody 
cooperates. If the accuracy is high enough, then 
CD can be efficient. For intermediate values of 
the accuracy, there is a small window in which 
CP can be efficient, as reflected in the title 
of the paper®: “Indirect reciprocity provides 
only a narrow margin of efficiency for costly 
punishment” 

Ina further step in their modelling, Ohtsuki 
et al.° dropped the assumption that all good 
or bad reputations are publicly known, and 
allowed individual knowledge of reputa- 
tions. They found that the stability of both 
CD and CP is lost when there is the smallest 
error in distinguishing between good and bad. 
When individuals start to communicate with 
each other and adjust their assessments of 
everybody’s reputation, the CP action rule 
can be stably maintained. Then, when repu- 
tations become even more publicly agreed 
upon through more efficient gossip, both CD 
and CP are stably maintained under their 


Alice's reputation Alice's reputation 


| | 


Social norm 


Figure 1| Action rules, social norms and the story of Alice and Bob. Bob meets Alice and learns whether 
Alice has a good or a bad reputation. On the basis of that, Bob may either help (cooperate), refuse help 
(defect) or refuse help and, in addition, decrease Alice’s wealth (punish). How Bob reacts is specified 

in his action rule. Bob’s own reputation depends on how society’s social norm evaluates Bob’s reaction 
to Alice’s reputation. The social norm ‘stern-judging’ evaluates cooperation with good as good and 
punishment of bad as bad; the social norm ‘shunning’ evaluates cooperation with good as good and 
punishment of bad as good. Each social norm specifies which reputation to assign for each of the 

6 possible scenarios (3 actions of Bob for 2 reputations of Alice). This leads to 2° = 64 social norms. 
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they accrete material as they move about ina 
broader gravitational potential, gathering mass 
through competition with other dense regions 
in the same gravitationally bound region® (the 
‘competitive accretion’ picture)? Although 
neither hypothesis is amenable to definitive 
observational tests, the dendrogram method 
developed by Goodman eft al. has the potential 
to answer this question and to identify the real 
conditions in which stars form. a 
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A much-needed theoretical analysis deals with whether the principle known 
as ‘costly punishment’ helps to maintain cooperation in human society. It 
will prompt a fresh wave of experiments and theory. 


Human societies are built on cooperation, 
especially on reciprocation’ — I help you and 
you help me, or I help you and someone else 
helps me. In the first case, help is directly recip- 
rocated by help. In the second, called indirect 
reciprocity, I gain a good reputation and so I 
can expect help when in need. 

But how shall I treat someone with a bad 
reputation? Shall I just refuse help or shall I 
punish this person at a cost to myself? Costly 
punishment can enhance cooperation” 
in experiments with human subjects, but 
potentially with no net benefit*: the costs of 
punishment usually, although not always’, 
neutralize gains from enhanced cooperation. 
On page 79 of this issue, Ohtsuki et al.” describea 
theoretical test of whether either refusing 
help to or punishing someone with a bad 
reputation might lead to a cooperative soci- 
ety. They conclude that, except under certain 
rare conditions, punishment does not produce 
that outcome. 

When you meet someone needing help, you 
can help (cooperate), refuse to help (defect) 
or not only refuse to help but, in addition, 
decrease the needy person’s wealth (punish). 
Both cooperation and punishment are costly 
for you, but respectively create a larger ben- 
efit or larger loss for the person needing help. 
Defection is cost neutral. 

How you behave depends on the reputation 
— good or bad — of the needy person, and 
depends upon your ‘action rule. An example 
is ‘cooperate with someone with a good rep- 
utation and defect with someone with a bad 
reputation (CD). The reputation you yourself 
gain by applying your action rule depends on 
the social norm of your society. Under the 
norm ‘stern-judging, for example, you gain 
a good reputation when cooperating with good 
or when defecting with bad, and abad reputation 


in all other cases. Thus CD always leads to a 
good reputation under stern-judging. 
Another action rule, CP, prescribes ‘cooperate 
with good and punish bad’. Under stern- 
judging, with CP you will achieve a good repu- 
tation when you interact with someone with 
a good reputation and a bad reputation when 
you interact with someone with a bad reputa- 
tion. But under a different social norm, ‘shun- 
ning’ (cooperation with good or punishment of 
bad leads to a good reputation), CP will always 
provide you with a good reputation (Fig. 1). 
In their simplest model, Ohtsuki et al.° 
assume that everybody has the same opinion 
of the reputation of another person or has 
the same level of fallibility in assigning an 
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incorrect reputation. For such a society, they 
test for each of the 64 different social norms 
(Fig. 1) whether an action rule exists that both 
generates a cooperative society and is evolu- 
tionarily stable — meaning one that resists 
replacement (invasion) by any of the other 
eight possible action rules (Fig. 1). 

Ohtsuki et al. find that the two action rules 
that induce cooperation and resist invasion 
are those described above — CD under stern- 
judging and CP under shunning. Nonetheless, 
the average pay-off is lower if the action rule 
uses costly punishment, while the stability 
conditions are less restrictive. 

However, which parameters determine which 
rule is most efficient in the sense of leading to 
the highest average pay-off at equilibrium? It 
turns out that a crucial one is the accuracy of 
assigning the correct reputation to everybody. 
If this accuracy is too low then only a DD 
action rule is efficient, under which nobody 
cooperates. If the accuracy is high enough, then 
CD can be efficient. For intermediate values of 
the accuracy, there is a small window in which 
CP can be efficient, as reflected in the title 
of the paper®: “Indirect reciprocity provides 
only a narrow margin of efficiency for costly 
punishment” 

Ina further step in their modelling, Ohtsuki 
et al.° dropped the assumption that all good 
or bad reputations are publicly known, and 
allowed individual knowledge of reputa- 
tions. They found that the stability of both 
CD and CP is lost when there is the smallest 
error in distinguishing between good and bad. 
When individuals start to communicate with 
each other and adjust their assessments of 
everybody’s reputation, the CP action rule 
can be stably maintained. Then, when repu- 
tations become even more publicly agreed 
upon through more efficient gossip, both CD 
and CP are stably maintained under their 
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Figure 1| Action rules, social norms and the story of Alice and Bob. Bob meets Alice and learns whether 
Alice has a good or a bad reputation. On the basis of that, Bob may either help (cooperate), refuse help 
(defect) or refuse help and, in addition, decrease Alice’s wealth (punish). How Bob reacts is specified 

in his action rule. Bob’s own reputation depends on how society’s social norm evaluates Bob’s reaction 
to Alice’s reputation. The social norm ‘stern-judging’ evaluates cooperation with good as good and 
punishment of bad as bad; the social norm ‘shunning’ evaluates cooperation with good as good and 
punishment of bad as good. Each social norm specifies which reputation to assign for each of the 

6 possible scenarios (3 actions of Bob for 2 reputations of Alice). This leads to 2° = 64 social norms. 
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corresponding social norms. Experimental 
studies’ in indirect reciprocity have shown 
that gossip can indeed serve as a surrogate 
for direct observation. But it will take further 
empirical research to find out whether gossip 
is efficient enough to re-establish both CD 
and CP. 

The next question addressed by Ohtsuki 
et al. was which kind of society someone might 
prefer to live in. To this end, they simulated 
one society with CD under stern-judging and 
another with CP under shunning. When indi- 
viduals can choose freely between them, the 
CD society — that with the higher expected 
pay-off — is preferred. Thus, the CP rule 
loses to CD when people can choose between 
societies with different norms. 

This last result can be compared with our 
own experimental work with human sub- 
jects®. We found that when individuals had 
the choice between a CD-only society and one 
with both CD and CP, they ultimately preferred 
the latter. Compared with a CP-only control, 
punishing acts were largely reduced in the 
CD plus CP society but were concentrated on 
the most uncooperative players, rendering 
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them more cooperative. Our experimental 
societies in which CD and CP coexisted 
were more efficient than those with only CP, 
suggesting that they have a more complex 
action rule: respond to good with cooper- 
ate, to bad with defect, and to very bad 
with punish. Such a possibility sets a challenge 
for theorists. 

Finally, given that Ohtsuki et al. show that 
the social norm ofa society determines which 
action will prevail, another task is to uncover the 
social norms of real societies and analyse which 
action rule to expect. Ohtsuki et al. assume that 
all social norms are equally likely. However, the 
more information a norm requires in order to 
develop, the more susceptible it is to errors 
and the more costly is the information acqui- 
sition’. Such restrictions may challenge any 
social norm that otherwise dominates: for 
example, in an experimental study"®, the 
subjects had a majority social norm similar 
to shunning that was ‘low observation’ and 
‘memory demanding. 

Ultimately, study of the joint evolution of 
social norms and action rules under natu- 
ral constraints is the goal for the future. 


Ohtsuki et al. have prepared the ground for 
that endeavour. | 
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As Charles Darwin showed so 
convincingly, the fauna of islands 
provide excellent subjects for 
investigating evolution. The 
creature on the left, Anolis sagrei, 

is acase in point. Jonathan Losos 
and colleagues studied this lizard 
in experimental work, carried out in 
2003, that involved the introduction 
of a predator of this species onto 
six islands in the Bahamas. Six 
other islands acted as controls. 
Losos and colleagues’ aim was 
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to test the hypothesis that, 

when organisms experience new 
environments, behavioural change 
prevents the operation of natural 
selection (they concluded that in 
this case it did not). 

Readers can find out for 
themselves what the authors did at 
www.nature.com/evolutiongems. 
The paper concerned is one of 
“Fifteen evolutionary gems: A 
resource for those wishing to spread 
awareness of evolution by natural 


selection”, which from today will 
be available as a collection on the 
Nature website. The “gems” are all 
papers, published in Nature over the 
past decade or so, that demonstrate 
the enduring explanatory power of 
Darwinian natural selection. 
Examples are drawn from the fossil 
record, from extant organisms in 
natural and experimental habitats, 
and from molecular studies. The 
other images here — of (clockwise 
from top left) water fleas, Daphnia 
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magna; fledglings of the great 

tit, Parus major; sticklebacks, 
Gasterosteus aculeatus; and a garter 
snake of the species Thamnophis 
sirtalis — provide a taster of the 
subjects of other papers in the 
collection. 

The papers are free to download 
and disseminate, and each is 
accompanied by a brief editorial 
introduction to its context and 
significance. 

See also Editorial, page 8. 


K. TAYLOR/NATUREPL.COM; L. STOCKER PHOTOLIBRARY.COM; K. TAYLOR/NATUREPL.COM; R. PLANCK/NHPA 


P. LLEWELLYN/FLPA 


NEWS & VIEWS 


NATURE|Vol 457/1 January 2009 


corresponding social norms. Experimental 
studies’ in indirect reciprocity have shown 
that gossip can indeed serve as a surrogate 
for direct observation. But it will take further 
empirical research to find out whether gossip 
is efficient enough to re-establish both CD 
and CP. 

The next question addressed by Ohtsuki 
et al. was which kind of society someone might 
prefer to live in. To this end, they simulated 
one society with CD under stern-judging and 
another with CP under shunning. When indi- 
viduals can choose freely between them, the 
CD society — that with the higher expected 
pay-off — is preferred. Thus, the CP rule 
loses to CD when people can choose between 
societies with different norms. 

This last result can be compared with our 
own experimental work with human sub- 
jects®. We found that when individuals had 
the choice between a CD-only society and one 
with both CD and CP, they ultimately preferred 
the latter. Compared with a CP-only control, 
punishing acts were largely reduced in the 
CD plus CP society but were concentrated on 
the most uncooperative players, rendering 
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them more cooperative. Our experimental 
societies in which CD and CP coexisted 
were more efficient than those with only CP, 
suggesting that they have a more complex 
action rule: respond to good with cooper- 
ate, to bad with defect, and to very bad 
with punish. Such a possibility sets a challenge 
for theorists. 

Finally, given that Ohtsuki et al. show that 
the social norm ofa society determines which 
action will prevail, another task is to uncover the 
social norms of real societies and analyse which 
action rule to expect. Ohtsuki et al. assume that 
all social norms are equally likely. However, the 
more information a norm requires in order to 
develop, the more susceptible it is to errors 
and the more costly is the information acqui- 
sition’. Such restrictions may challenge any 
social norm that otherwise dominates: for 
example, in an experimental study"®, the 
subjects had a majority social norm similar 
to shunning that was ‘low observation’ and 
‘memory demanding. 

Ultimately, study of the joint evolution of 
social norms and action rules under natu- 
ral constraints is the goal for the future. 


Ohtsuki et al. have prepared the ground for 
that endeavour. | 
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As Charles Darwin showed so 
convincingly, the fauna of islands 
provide excellent subjects for 
investigating evolution. The 
creature on the left, Anolis sagrei, 

is acase in point. Jonathan Losos 
and colleagues studied this lizard 
in experimental work, carried out in 
2003, that involved the introduction 
of a predator of this species onto 
six islands in the Bahamas. Six 
other islands acted as controls. 
Losos and colleagues’ aim was 
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to test the hypothesis that, 

when organisms experience new 
environments, behavioural change 
prevents the operation of natural 
selection (they concluded that in 
this case it did not). 

Readers can find out for 
themselves what the authors did at 
www.nature.com/evolutiongems. 
The paper concerned is one of 
“Fifteen evolutionary gems: A 
resource for those wishing to spread 
awareness of evolution by natural 


selection”, which from today will 
be available as a collection on the 
Nature website. The “gems” are all 
papers, published in Nature over the 
past decade or so, that demonstrate 
the enduring explanatory power of 
Darwinian natural selection. 
Examples are drawn from the fossil 
record, from extant organisms in 
natural and experimental habitats, 
and from molecular studies. The 
other images here — of (clockwise 
from top left) water fleas, Daphnia 
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magna; fledglings of the great 

tit, Parus major; sticklebacks, 
Gasterosteus aculeatus; and a garter 
snake of the species Thamnophis 
sirtalis — provide a taster of the 
subjects of other papers in the 
collection. 

The papers are free to download 
and disseminate, and each is 
accompanied by a brief editorial 
introduction to its context and 
significance. 

See also Editorial, page 8. 
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18 years of science with the Hubble Space 
Telescope 


Julianne J. Dalcanton!” 


After several decades of planning, the Hubble Space Telescope (HST) was launched in 1990 as the first of NASA's Great 

Observatories. After a rocky start arising from an error in the fabrication of its main mirror, it went on to change forever many 
fields of astronomy, and to capture the public's imagination with its images. An ongoing programme of servicing missions has 
kept the telescope on the cutting edge of astronomical research. Here | review the advances made possible by the HST over 


the past 18 years. 


he impact of the HST on the public imagination is large, yet 

many are surprised to learn that the HST is rather modestly 

sized in comparison with modern telescopes (Fig. 1). Its 

mirror is only a generous arm span across, with less than 
~1/15th the light-gathering area of the largest telescopes available on 
the ground. Its instruments are likewise far from state of the art, 
containing detectors whose technologies are sometimes more than 
a decade old. What then accounts for the HST’s tremendous scientific 
and public impact? 

The HST’s successes can be attributed to the same three key factors 
influencing real estate: location, location, location. Raising the HST 
above most of the Earth’s atmosphere has allowed it to escape a host 
of problems that limit telescopes on the ground. The first of the 


Figure 1| The Hubble Space Telescope, while docked with the Space 
Shuttle Endeavour during servicing. The new instruments installed during 
such missions have kept HST at the cutting edge of astronomy. Image 
courtesy of NASA. 


resulting improvements is the dramatic increase in a telescope’s 
resolution—the smallest angular separation that can be reliably 
detected (Fig. 2). Images taken with the HST can distinguish features 
that are separated by less than a tenth of an arc second (comparable to 
the angle spanned by half a millimetre when seen from a kilometre 
away). Comparable images made from the ground are typically 
blurred by a factor of ten relative to those made with the HST. 
Moreover, because the HST’s view of the Universe is unperturbed 
by the turbulent, chaotic atmosphere, the images and spectra taken 
by the telescope are stable and reproducible. This stability allows for 
an unprecedented level of precision when measuring the brightness 
and structure of astrophysical objects. 

The second, less publicized benefit of the HST’s location is the 
darkness of space-based imaging. From the ground, the night sky is 
not actually all that dark. Atoms in the upper atmosphere absorb 
energy from sunlight during the day, and then re-radiate that energy 
as light during the night. In much the same way that you can see more 
stars from a dark campground than from a bright city street, from 
space the HST can measure astronomical features fainter than those 
ground-based telescopes can detect against the glowing background 
of the night sky. This increase in contrast is especially pronounced at 
ultraviolet wavelengths, where photons are otherwise absorbed by the 
Earth’s atmosphere and are thus inaccessible from the ground. Access 
to this portion of the electromagnetic spectrum is critical for tracing 
the abundance of individual elements in astrophysical gases, and for 
interpreting observations of galaxies at large distances. 

In this article I summarize some of the many areas where the HST 
has dramatically changed our understanding of the Universe. These 
include tracing the structure and evolution of stars from their births 
through to their deaths, demonstrating the pervasiveness of black 
holes and their links to galaxy formation, tracking the evolution of 
galaxies over billions of years, and testing fundamental models of the 
expansion of the Universe. 


Distance scale 


One of the most consistently vexing issues facing astronomers is 
measurement of the distances to astronomical objects. It perhaps 
seems odd that something so basic, and so easily judged on Earth, 
can be so difficult in an astronomical context. However, in astro- 
nomy we observe systems whose energy outputs and distances can 
differ by a dozen orders of magnitude. We therefore often have little a 
priori knowledge with which to estimate the basic size and energy 
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Ground: Subaru (8 m) 
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Space: HST (2.4 m) 


Figure 2 | Images taken from space have superb spatial resolution. Images 
of a distant galaxy taken from the ground (with the Subaru 8-m telescope 
(National Astronomical Observatory of Japan) on Mauna Kea) are blurred 


scales of a given system. Are we observing a faint object that happens 
to be close, or a bright object that is much farther away? Moreover, 
even when we can make reasonable assumptions about an object’s 
distance, we frequently lack the precision needed for critical cos- 
mological tests. 

Over the past century, astronomers have built up an elaborate 
distance ‘ladder’, which uses the distances to the nearest stars to infer 
the distances to more distant objects. We start the ladder with stars 
that are sufficiently close that their position on the sky shifts during 
the Earth’s orbit around the Sun. This parallax effect (which you can 
see by alternately shutting your left and right eyes) can be used to 
calculate a distance on the basis of the principles of geometry only, 
making it robust against systematic errors. The only limit to the 
method is the precision with which we can measure slight angular 
shifts. With higher resolution, parallax measurements can be pushed 
to much larger distances. 

The HST excels at parallax measurements owing to both its superb 
resolution and its Fine Guidance Sensor (FGS). Although the FGS is 
primarily responsible for keeping the telescope oriented precisely 
during scientific observations, it can also be used to make milli- 
arc-second measurements of the angular positions of individual 
stars. Moreover, the FGS is quite sensitive and can measure accurate 
parallaxes for stars that are much fainter than those that could be 
studied with the earlier parallax mission satellite HIPPARCOS (the 
High Precision Parallax Collecting Satellite). By measuring parallaxes 
of fainter stars, the HST can probe to larger distances, and sample a 
larger volume of the Galaxy. This allows the HST to calibrate the 
luminosities of truly rare stars which are difficult to find nearby, 
but that are crucial for establishing more distant rungs on the dis- 
tance ladder. 

One of the most important types of such stars are Cepheids, a class 
of pulsating stars that are up to thousands of times more luminous 
than the Sun, and can thus be identified in galaxies far outside the 
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in comparison with images of the same galaxy made with the HST. Images 
courtesy of NASA. 


Milky Way owing to their brightnesses and characteristic periodic 
variations in luminosity. A Cepheid’s luminosity is tightly correlated 
with the duration of its pulsation cycle. Once this correlation is 
calibrated, one can then deduce the distance to a Cepheid by mea- 
suring the duration of its pulsation, inferring its luminosity and then 
comparing the luminosity with the apparent brightness. The accu- 
racy of the derived distance is typically limited by the accuracy of the 
initial period—luminosity relationship. The FGS on the HST has 
made direct parallax measurements of this relationship’, greatly 
reducing the uncertainties in Cepheid-based distances. Although this 
period—luminosity relationship may potentially vary with the metal- 
licity of the galaxy*, with appropriate corrections it can reliably be 
used for many massive spiral galaxies. Previously, climbing from 
reliable parallax measurements of nearby stars to Cepheid-based dis- 
tances to spiral galaxies required many complicated steps up the 
distance ladder. With the new HST results, astronomers can make 
this step in a single jump. 

Identifying Cepheids in nearby galaxies requires the HST’s reso- 
lution. Although Cepheids are luminous, they can appear to be 
blended together with the light from many other stars in images from 
ground-based telescopes, making it impossible to identify the lumin- 
osity fluctuations or the brightness of the one star in the cluster that 
happens to be a Cepheid. In contrast, the high angular resolution of 


Figure 3 | A montage showing the life cycle of stars. a, The Orion nebula is 
a dense cloud of gas within which new stars are forming. The inset shows a 
close-up of a young star surrounded by a dusty disk that may eventually 
form planets. b, Young blue stars surrounded by leftover natal gas in the 
Large Magellanic Cloud, a nearby dwarf galaxy. c, The evolving star V838 
Monocerotis, whose past flashes have illuminated material that was ejected 
in an early stellar wind. d, A planetary nebula forms as a dying star sheds its 
outer layers, leaving behind a fading white dwarf at its centre. e, A globular 
cluster containing many old stars. The faintest stars in the cluster are white 
dwarfs. Images courtesy of NASA. 
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the HST allows Cepheids to be separated from their neighbouring 
stars out to vastly larger distances, increasing the number of galaxies 
with potentially reliable distances by a factor of close to 1,000. 
Establishing Cepheid-based distances to nearby galaxies was one of 
the Key Projects identified before the HST’s launch. The Cepheid Key 
Project has since used hundreds of orbits to analyse Cepheids in 36 
galaxies, all of which were used to calibrate even more distant rungs of 
the ladder (such as supernovae and the Tully—Fisher relationship). 
These Cepheid distances have become the bedrock foundation of 
our understanding of the size scale of the distant Universe. Distances 
to galaxies can be inferred from the ‘Hubble diagram’— a linear 
relationship between the distance to a galaxy and the speed at which 
it appears to be receding from us. The slope of this relationship is 
known as the Hubble constant, and once its value is known it is 
possible to measure recessional velocities to galaxies (which is 
straightforward) and use them to infer distances (which is difficult). 
The HST’s observations of Cepheids have been essential in establish- 
ing the distance portion of the Hubble diagram, and thus have laid 
the foundations for modern measurements of the Hubble constant. 
Before the HST, the Hubble constant was known only to within a 
factor of two, but with the new Cepheid observations the uncertainty 
in the distance scale of the Universe has dropped to roughly 10%. 


The life cycle of stars 


Although stars are frequently assumed to be constant and unchan- 
ging features of the firmament, they are in fact evolving dynamic 
systems. New stars condense out of gaseous nebulae, and old stars 
evolve through planetary nebulae and supernovae into white dwarfs, 
neutron stars and black holes. These processes—star formation and 
evolution—are critical to understanding many features of the 
Universe, including the evolution of galaxies, the dispersal of chem- 
ical elements and the distribution and energetics of gas. 

Some of the HST’s most lasting (and beautiful) contributions to 
stellar astronomy have been its studies of star-forming regions like 
the Orion nebula (Fig. 3). In these regions, luminous massive stars 
ionize the gas cloud from which they coalesced, causing the cloud to 
glow brightly in various emission lines. The HST’s earliest observa- 
tions of the Orion nebula** revealed that it was peppered with a 
remarkable population of young stars surrounded by dense disks of 
gas and dust. These disks are undoubtedly remnants of the late accre- 
tion phase during which the stars condensed. Although the presence 
of such disks had been inferred from theory’ and from observations 
with the Very Large Array®, the HST’s superior image resolution 
revealed the first true pictures of the disks’ structures and physical 
properties. 

The observed ‘proto-planetary disks’ are exactly the type of system 
that astronomers expect will evolve into planetary systems like those 
hosted by our own Sun. The ubiquity of the disks seen in the HST 
images gave strong hints that our Solar System is unlikely to be 
unique, and recent studies have indeed found direct evidence for 
the ‘extrasolar’ planetary systems that are expected to evolve from 
the disks seen in the images. The HST is studying intermediate stages 
of this process directly, by mapping the detailed structures of disks 
that have lost their gas but are still dominated by dust that will 
eventually be either incorporated into planets or dispersed". 
Although individual cases had been observed from the ground, the 
HST has proven the ubiquity of such systems. Moreover, with its 
superb resolution, the HST is uncovering evidence of structure in 
these debris disks, indicating that they have been shaped by the grav- 
itational influence of planets orbiting the young, distant host stars''. 

In addition to revealing the properties of young stars, the HST 
continues to help us understand the older stars that dominate the 
stellar content of our Galaxy. Although many of these stars are rela- 
tively ordinary stars like the Sun, the majority are much fainter and 
are of lower mass. At these low masses, the stars begin to have more in 
common with planets, hosting complex atmospheres and sometimes 
lacking a central energy source (brown dwarfs). Luckily, many of 
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these stars are also locked into gravitationally bound orbiting pairs, 
known as binaries, which serve as laboratories for measuring of the 
properties of stars. The orbits of binary stars can be used to derive 
their masses, and, because the stars in the binary can be assumed to 
have formed simultaneously, it can also be assumed that they have a 
common age and metallicity. Binaries are thus particularly useful 
when studying poorly understood types of star. Unfortunately, most 
stars are far enough away that the images of the two stars in the binary 
cannot be separated in a typical image, making it difficult to measure 
the orbits, luminosities, colours and spectra of the two components. 
This step is particularly difficult when one of the stars is much fainter 
than the other, as is common for low-mass stars, which are among the 
most complex and least well-understood of normal stars. However, 
with the HST’s spatial resolution, image stability and low back- 
ground, the properties of binary stars can easily be measured. The 
HST has thus powered critical advances in our understanding of low- 
mass stars and brown dwarfs, by identifying the earliest brown dwarf 
binaries'””’, characterizing the relationship between mass and lumin- 
osity for low-mass stars'*, making dynamical mass measurements for 
field brown dwarfs'*’®, finding unusual T dwarf binaries'” and con- 
straining the rate of binarity at the lowest stellar masses'*"’. It has also 
been an essential tool in simply quantifying the numbers of the lowest 
mass stars and failed brown dwarfs, both in young”””’ and old’*”’ 
stellar clusters. 

The HST is yielding surprises even for more ordinary stars. For 
most of the past century, astronomers believed that the simplest 
collections of stars could be found in globular clusters, which are 
dense, gravitationally bound groups of ~10*-10° stars. The stars in 
globular clusters were thought to have formed at exactly the same 
time, from a single cloud of gas, giving them identical ages and 
chemical compositions. Unfortunately, our ability to measure the 
properties of individual stars in globular clusters is compromised 
by the high stellar densities within the clusters, and only with the 
HST can individual stars be separated from one another in an image. 
Recently, the exceptional photometric stability and high resolution of 
the HST have revealed that some globular clusters have far more 
complex properties than were previously thought possible: some host 
stars that formed in multiple events and/or with distinct chemical 
compositions”. Although this work is new, it has already begun 
overturning half a century of thought on the nature of these suppo- 
sedly ‘simple’ systems. 


The death of stars 


After their life of quiescent hydrogen burning is over, stars undergo 
dramatic changes. When the primary energy source at the centre of a 
star becomes exhausted, the star is forced to undergo dramatic evolu- 
tion. During this evolution, the star expands drastically, sheds much 
of its mass and either explodes or fades away, leaving behind a bizarre 
stellar remnant (such as a white dwarf, neutron star or black hole). 

These late phases of stellar evolution are surprisingly brief on 
typical astronomical timescales. Astronomers have used the HST to 
track the final stages of mass loss in rapidly evolving stars like 
Carinae, using time-resolved measurements to watch the expansion 
of the stellar ejecta in real time****. They have also revealed the ghosts 
of past mass ejection from evolving stars like V838 Monocerotis (ref. 
29) or from SN 1987A (refs 30-32) using ‘light echoes’ that have 
bounced off expanding shells of matter ejected farther and farther 
into the recent past. Equally striking are the HST images of planetary 
nebulae*’, formed after most mass loss is complete in lower mass 
stars. Like the images of 7 Carinae and V838 Monocerotis, the plan- 
etary nebulae show a rich level of complexity****, some of which 
varies over year-long timescales*”**. Most importantly, in almost 
none of these cases does the distribution of ejected mass look spher- 
ical, in spite of the nearly spherical shape of the initial star. This 
asymmetry suggests that the process of mass loss in stars is complex, 
and involves ‘shaping’ of the stellar winds by rotation, magnetic fields 
and/or orbiting stellar companions. 
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Similar evidence that even apparently symmetric stars have asym- 
metric deaths comes from HST identifications of neutron stars, 
which are incredibly dense stellar remnants left behind after the 
supernova explosion of a massive star. Although they begin their lives 
as stars larger than the Sun, neutron stars are only a few kilometres 
across, making them very faint at optical wavelengths. The optical 
emission provides a critical means of constraining their temperatures 
and sizes, which in turn tells a great deal about the strange nuclear 
matter within the stars. The HST has been able to identify the optical 
counterparts of several neutron stars, and through repeated observa- 
tions has found that some are hurtling through the galaxy at a sur- 
prisingly high speed of more than 100kms ' (ref. 39). These 
observations suggest that the supernova somehow gives the dense 
stellar remnant a tremendous ‘kick’, shooting it out of the supernova, 
despite the initial star being essentially spherical. 

The HST is also shedding light on the connection between super- 
novae from dying stars and mysterious y-ray bursts. At the HST’s 
launch, it was not known if these fast bursts of intense y-ray radiation 
were from sources inside our galaxy or far outside it. Subsequent 
satellites showed that they were occurring outside our galaxy, and 
HST observations of the fading optical transients further localized 
some of the bursts to individual galaxies, in regions containing 
young, massive stars***. HST thus convincingly showed that these 
highly energetic events are tied to the very last phases of the lives of 
massive stars. The theoretical models resulting from these observa- 
tions are suggesting that such ‘hypernovae’ may be essential for estab- 
lishing the unusual elemental abundance patterns seen in the oldest 
stars. 


Black holes 


The same explosions that lead to supernovae and y-ray bursts can also 
lead to the formation of remnant black holes. These enigmatic objects 
form when mass collapses to within an extremely small radius, 
known as the Schwarzschild radius. For a star like the Sun to turn 
into a black hole, it would need to shrink by a factor of more than 
200,000, down to a radius of less than 3 km. When a massive star runs 
out of fuel it collapses to within its Schwarzschild radius, becoming 
sufficiently dense that even light cannot escape from the tremendous 
gravitation field, and leading to the name ‘black hole’. In spite of their 
name, black holes can be responsible for some of the most luminous 
astrophysical phenomena, owing to the high temperatures of the 
material that they sometimes accrete. However, in many cases black 
holes are truly dark, and their presence can only be inferred from 
their gravitational influence on the material around them. 

Key to finding a dark black hole is to look for evidence of a large 
amount of mass being in a very small space. We can deduce that mass 
is present by the motions produced by the gravitational pull of the 
black hole. When these motions are very fast, and show rapid changes 
in velocity and/or direction over a very small distance, then we can 
infer the presence of a black hole. However, we need measurements of 
velocities that probe exceptionally small physical distances to ensure 
that the measured mass is in a small enough volume that it can be 
explained only by a black hole. Finding black holes in even the nearest 
galaxies thus requires the exceptional angular resolution of the HST. 

Over the past several decades, the HST has revolutionized our view 
of black holes and the role they have in galaxy formation. At the 
launch of the HST, it was generally assumed that supermassive black 
holes were needed to explain the unusually high luminosities and 
peculiar spectra of distant quasi-stellar objects (QSOs) and lower 
luminosity nearby active galactic nuclei. However, there was then 
no firm evidence that QSOs were always hosted by galaxies, or that 
all galaxies currently hosted remnant black holes left over from an 
early epoch when QSOs were common. Some ground-based obser- 
vations strongly suggested that a handful of the nearest galaxies might 
host dense, massive objects at their centres, but the identification of 
these as black holes was far from confirmed”. 


REVIEWS 


Since its launch, the HST has been engaged in a systematic obser- 
vational assault on these issues. One of the earliest HST Key Projects 
was responsible for establishing that all QSOs (which were thought be 
powered by black holes) were indeed hosted by galaxies**. Later, 
dynamical modelling of HST spectra in nearby galaxies confirmed 
the presence of black holes through their gravitational influence on 
the surrounding stars**“*. These studies led to dozens of robust mea- 
surements of central black hole masses, which then revealed an extre- 
mely tight correlation between black hole mass and the mass of the 
galactic bulge (the smooth, rounded stellar component at the galaxy’s 
centre)**”°; this relationship had been hinted at in previous ground- 
based work, but with much larger uncertainty and scatter. Moreover, 
the modelling showed that even dormant galaxies without luminous 
active nucleim—a sign of accretion onto a black hole—also hosted 
black holes, confirming that central, massive black holes are a generic 
feature of galaxy formation. 

This surprising result has yet to be understood. The black holes that 
form during the deaths of massive stars have relatively small masses, 
and are unlikely to be more than 20 times more massive than the Sun. 
By contrast, black holes at the centres of galaxies are millions or bil- 
lions of times more massive than the Sun, leaving open questions 
about how the supermassive black holes formed, how they migrated 
to the centre and how their masses ‘knew’ about the masses of their 
host galaxies. HST observations are beginning to unravel some of 
these questions, hinting that intermediate, ~10*-solar-mass, black 
holes might exist in some globular clusters*'** that could later accrete 
into the galactic centre, but that some bulgeless galaxies (like our 
neighbour M33) have no evidence for a central black hole at all°’. 
The HST’s resolution has also been essential to the observation of 
structures that may be either analogues or precursors of central black 
holes. These ‘nuclear clusters’ are dense stellar knots found at the 
centres of some galaxies. The masses of nuclear clusters are also cor- 
related with galaxy properties”, raising the suspicion that they are 
potentially visible manifestations of the processes that may lead to 
supermassive black holes. The HST has shown that at least some of 
these clusters are disk-like, suggesting that gas accretion may have a 
role in the growth of massive structures at the centres of galaxies”’. 

Given the presence of massive black holes at the centres of most 
galaxies, it might be expected that the black holes have helped to 
shape the distribution of stars around them. Such an effect should 
be strongest in massive galaxies, owing to the more massive black 
holes within. Indeed, some of the earliest investigations with the HST 
revealed strong differences between the central stellar distributions of 
massive and low-mass galaxies**’; these differences, although 
recently questioned”, have been confirmed in many subsequent 
studies*'®’, Galaxies were found to have a sharp bimodality at their 
centres, with the more massive galaxies having flat ‘cores’ in their 
light distributions and lower mass galaxies having a bright central 
cusp. Some models predict that these flat cores could be produced by 
pairs of central black holes, the orbits of which can fling stars out of 
the galactic centres™ either before or during merging. Although the 
observed trend can be interpreted as a response to the presence of 
central black holes, there are many other properties of the galaxies 
that similarly correlate with the central light profile, such as the 
overall shape of the galaxy (box-like versus disk-like, triaxial versus 
axisymmetric) and the internal kinetics (rotating versus non-rotat- 
ing). These trends may indicate that the dichotomy between core and 
cusp galaxies may transcend the properties of the central black holes, 
and may instead be intimately linked with the formation of the galaxy 
as a whole. 


Growth of galaxies 

At the time that the HST was launched, astronomers had long been 
used to seeing the familiar, regular forms of galaxies at the present day. 
However, there were long-standing questions as to how these forms 
came about, and how they may have changed and evolved through 
time. In principle, the question could be answered by looking at distant 
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galaxies, which are so far away that light emitted many billions of years 
ago is only now reaching us. Thus, in progressively more distant gal- 
axies we can see earlier and earlier snapshots of the galaxies’ evolution, 
reaching back to a time when the Universe was only a fraction of its 
present age—an effect much like receiving postcards sent from ever 
more distant places, carrying old news from the time they were sent. 
Unfortunately, owing to blurring produced by the Earth’s atmosphere, 
these distant galaxies appeared only as unresolved smudges in images 
taken from the ground. 

With the launch of the HST, astronomers got their first look at the 
shapes and structures of young galaxies, and what they saw was sur- 
prising (Fig. 4). Instead of looking like close analogues of the spiral and 
elliptical galaxies we see today, images of the distant Universe revealed 
a large population of galaxies that looked not unlike insects splattered 
on a windscreen. This population can be seen most dramatically in 
images of the Hubble Deep Fields**® and the later-generation Ultra 
Deep Field®’, made with a more sensitive, larger-format camera. 
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Figure 4 | Galaxies near and far. a, Images showing the diverse properties of 
galaxies. b, Unusually shaped distant galaxies in the Hubble Ultra Deep 
Field. Images courtesy of NASA. 
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The HST deep field images revealed that young galaxies were built 
out of fragments, through a process that we now understand to be the 
hierarchical build-up of smaller galaxies into ever larger ones. The 
HST images were key in establishing the current galaxy formation 
model, in which galaxy properties are shaped by the continual merger 
and accretion of matter over cosmic timescales’. Parallel work in 
clusters of galaxies showed the influence of merging on larger scales, 
using the combination of HST images and spectroscopy” **. Only 
with the HST could we measure the complicated structures and 
sizes**** of these younger evolving galaxies. 

Also key in establishing this change in model was the huge invest- 
ment made in measuring relative distances to the galaxies in the 
Hubble Deep Field (HDF). Measuring relative distances requires 
time-consuming spectroscopy of faint individual galaxies with which 
one can measure recessional velocities. Although such programmes are 
expensive in terms of limited telescope resources, given the unprece- 
dented allocation of the HST’s resources in producing the HDF, the 
astronomical community responded with an equally unprecedented 
investment of spectroscopic observations of the HDF galaxies***’. 
These spectroscopic observations became key in establishing the link 
between a galaxy’s observed colour and its distance. This ‘photometric 
redshift? technique had been developed for nearby galaxies**, but only 
with the HDF was it confirmed to be a viable method for measuring 
distances to the most distant galaxies*’. Since the verification of this 
technique in the HDF, there has been an explosion in the use of 
photometric redshifts, allowing major surveys”*’ to skip time- 
consuming spectroscopy when establishing the relative distances of 
galaxies. Photometric redshifts also allow astronomers to estimate 
distances to galaxies that are otherwise too faint for spectroscopic 
observations. Some of the earliest work applying photometric redshifts 
were used for the HDF itself’**®, and showed most clearly the assembly 
of galaxies from small lumps into large coherent units, and the accom- 
panying rise in the overall rate of star formation in the Universe”*””. 

In addition to the great strides made through its in situ studies of 
distant young galaxies, the HST has made equal progress using the 
complementary tool of studying the fossil record that galaxy forma- 
tion has left behind. This record is embedded in the individual stars 
that formed during a galaxy’s lifetime. Seen from the ground, these 
billions of stars blur into a single smooth image. However, when 
viewed at the highest spatial resolution, the smooth galaxy image 
breaks into millions of individual points, revealing individual stars, 
much like the pointillism seen in a Seurat painting. The colours and 
brightnesses of these stars encode information about their ages and 
chemical compositions”, and allow us to infer the past history of star 
formation in the galaxy. 

With the accurate measurements provided by the HST’s spatial 
resolution and photometric stability, astronomers have characterized 
the stellar populations of the oldest extended structures in gal- 
axies”'°°, tracing the earliest epochs of a galaxy’s assembly. They 
have also used the HST to map the ebb and flow of star formation 
across the disk of nearby galaxies, tracing the detailed evolution of 


star formation over hundreds of millions of years!” 


Some of the most widely recognized successes of the HST have been 
in the field of cosmology. Cosmology attempts to measure and 
understand the most basic large-scale features of the Universe, such 
as its age, composition and size. Constraining these quantities relies 
primarily on astronomical observations, which are among the only 
tools that can access the distances and ages over which the Universe’s 
evolution and structure can be measured. 

The age of the Universe can be inferred in multiple ways. The most 
secure measurements of the age come from the oldest stars we see 
today, whose ages place a secure lower limit on the age of the 
Universe. To derive ages for a population of stars, we ideally want 
to find a cluster of stars that were all born at the same time. The stars 
within the cluster all evolve at different rates, such that the most 
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massive stars quickly become supernovae, the intermediate-mass 
stars (like the Sun) evolve into planetary nebulae and leave behind 
a faint remnant white dwarf, and the lowest mass stars last for the age 
of the Universe. The age of the stellar cluster can then be inferred 
from the masses of the stars that are left behind. 

There has been an elegant series of HST measurements using this 
technique, focusing on white dwarfs. After a white dwarf forms, it no 
longer has a source of energy in its core. It therefore cools steadily, 
dropping in temperature and brightness ata rate that can be calculated 
from theoretical models. These models can be compared with the 
observed distribution of brightnesses to infer the age of the white 
dwarf, placing a robust lower limit on the age of the star’°*'”, 
Unfortunately, white dwarfs are sufficiently faint that they can be 
difficult to isolate from other stars. In a clever twist, astronomers have 
used repeated, deep observations with the HST to separate faint cluster 
stars from intervening and background objects''*"’. The stars in the 
cluster all share a common velocity, causing them to show slight shifts 
in position between the two epochs of observation. These shifts are 
distinct from those shown by foreground and background stars, 
allowing the cluster stars to be isolated down to very faint luminosities 
with little confusion. The resulting measurements of the distribution 
of white dwarf brightnesses are tremendously precise, and have placed 
secure lower limits on the age of the Universe that do not depend on 
uncertain observations at high redshift or assumptions about the 
underlying cosmology. Instead, they rely solely on our understanding 
of the physics of stars, making this a superb independent check on a 
fundamental cosmological measurement. This measurement could 
not have been made without the high resolution and the photometric 
(and astrometric) stability of the HST 

An independent probe of the Universe’s age comes from the HST’s 
measurement of the Hubble constant using Cepheid distances. In 
addition to providing a fundamental distance scale for the 
Universe, the value of the Hubble constant is directly related to the 
rate at which the Universe is expanding at the present day. Its value is 
therefore critical in extrapolating between the present and the Big 
Bang, and thus in establishing the duration of the expansion. 

Whereas the Hubble constant constrains the rate of the Universe’s 
expansion at the present day, astronomical observations can also be 
used to constrain the Universe’s past expansion. Our understanding 
of the past rate of expansion has been revolutionized by the study of 
the brightnesses of high-redshift supernovae. Progressively more dis- 
tant supernovae should be increasingly fainter, in a way that depends 
sensitively on the exact model of the Universe in which we live. 
Measurements of distant supernovae have shown that they appear 
unusually faint''*'**. However, the typical luminosities of super- 
novae are well understood, and their apparent faintness must thus 
indicate that they are much farther away than had been anticipated in 
the early 1990s. For this to be true, the Universe must be expanding 
far faster than in any model that includes only relatively normal 
forms of matter. This evidence for ‘dark energy’ or ‘cosmological 
acceleration’ was one of the ground-breaking scientific discoveries 
of the late 1990s. 

The HST has been essential in discovering and monitoring ever 
more distant supernovae, which probe early epochs in the Universe’s 
expansion. At such large distances, it is difficult to measure the prop- 
erties of supernovae from the ground, owing to their faintness and 
the blurring of their light with the surrounding galaxy. With the HST, 
however, the supernovae can be separated cleanly from the host 
galaxy, allowing the brightnesses to be measured. These precisely 
measured brightnesses constrain the distances to the supernovae, 
and limit the nature of the expansion at these early times, providing 
a strong limit on possible models for the nature of the dark energy. 

Supernova measurements of the dark energy suggest that it dom- 
inates the mass-energy density of the Universe, but there is still an 
enormous quantity of dark matter pervading space. Although its true 
nature eludes us, its mass and distribution can be probed through 
indirect techniques. One of the most successful techniques is through 
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the distortion that dark matter produces on images of background 
galaxies. Like looking through the bottom of a wine glass, the mass of 
dark matter distorts light passing through it, producing a warped 
image of all the images seen through the dark matter. Normally, this 
effect is too small to be measured, but for the densest concentrations 
of dark matter, found in massive clusters of galaxies, the effect can be 
dramatic, producing enormous arcs and ring-like structures around 
the clusters'”*. These images are among the most spectacular extra- 
galactic images produced by HST. 

With the precision offered by the HST’s resolution, astronomers 
have mastered techniques that use the observed distortions to recon- 
struct the distribution of mass that produced them. This makes it 
possible to map the dark matter distribution, in spite of not being 
able to see it directly’**"'”°. One of the most striking uses of this 
technique can be seen in recent observations of two colliding galaxy 
clusters (the ‘bullet cluster’). Maps of the dark matter constructed 
from HST images show a remarkable separation of the dark matter 
from the gas in the cluster'”’. Like two colliding streams of water, the 
gas was essentially stopped in the collision, whereas the dark matter 
and the galaxies passed through. These observations are striking con- 
firmation that the dark matter is not associated with the gas (which 
dominates the normal matter in the clusters), and that the dark 
matter acts like ‘collisionless’ particles that interact only through 
gravity and not any electromagnetic process. Thus, in spite of not 
knowing what the dark matter is, the HST has allowed us to learn 
what it is not, and how it behaves on large scales. 


Gas in the Universe 


Although cosmology tells us that the majority of the mass in the 
Universe is hidden in the dark sector, nearly all the light we see comes 
from normal, familiar forms of matter. This matter is only a small 
fraction of the Universe’s mass, but understanding what forms it 
takes and how it is distributed through space constrains many astro- 
physical questions. 

The majority of normal baryonic matter is thought to be in the gas 
phase, and can produce both emission lines and absorption lines in 
astrophysical spectra. The location (in wavelength) and amplitude of 
these lines can be used to constrain the chemical composition, tem- 
perature, density and pressure of the gas. Although optical spectro- 
scopy from the ground can routinely study some of the spectroscopic 
features produced by astrophysical gases, many of the most useful 
features are found at ultraviolet wavelengths, and can only be studied 
from space. 

Ultraviolet spectroscopy with the HST has allowed astronomers to 
track where gas is and how the amount of matter in the gas phase has 
changed with time. One of the earliest HST Key Projects mapped the 
distribution of gas in the (relatively) nearby Universe using the 
absorption lines produced by hydrogen along the line of sight to 
more distant quasars'**. Gas had previously been tracked in this 
manner, but only at larger distances that probe the properties of 
the Universe long ago. The evolution of the gas phase could therefore 
not be tracked continuously to the present, leaving the final destina- 
tion of the gas uncertain. Moreover, when restricted to such large 
distances, little was known about exactly what astronomical objects 
the gas was associated with. Was it bound to galaxies? Or was it 
distributed in the otherwise empty volume of space between galaxies? 
Only with the HST’s ultraviolet sensitivity could the amount and 
distribution of this gas be tracked to the present day, through the 
local distribution of galaxies. The HST thus made it possible to follow 
the evolution of the primary gas reservoir in the Universe from the 
earliest times to the present'”"'** and, moreover, to match the gas 
seen in absorption to other nearby galaxies'**"'**. These early studies 
showed that there was rapid evolution in the amount of broadly 
distributed gas, as it was absorbed into galaxies, and that there were 
multiple populations of absorbing clouds, some that were clearly 
associated with galaxies and others that resided in the ‘cosmic 
web’. More recently, ultraviolet spectroscopy with the HST has 


47 


©2009 Macmillan Publishers Limited. All rights reserved 


REVIEWS 


uncovered an even larger reservoir of gas'*’, which simulations'”° and 
subsequent ultraviolet observations with NASA’s Far Ultraviolet 
Spectroscopic Explorer'*” suggest is hiding in a hot, diffuse plasma 
surrounding nearby galaxies. This ionized gas may contain even 
more mass than is currently found in stars and cool gas. 

In addition to tracking the broad movement of gas from the cos- 
mic web into galaxies, the HST has been essential in tracing the 
detailed properties of gas within individual galaxies, and particularly 
within the Milky Way. Ultraviolet spectroscopy of nearby stars shows 
a rich network of absorption lines produced by several dozen differ- 
ent elements found in the gas in our galaxy. By analysing the depths of 
various absorption features, astronomers have shown in detail how 
different elements move in and out of the gas phase as they are 
expelled by stars and/or condense into solid dust grains'”’. 
Tracking these changes improves our understanding both of how 
these elements are produced deep within stars and of the properties 
of the dust which clouds almost every astronomical observation. 


A new way forward for astronomy 


Beyond the direct scientific impact of its observations, the HST has 
had a significant role in shaping the culture of astronomy. In many 
scientific fields, data are proprietary, and are owned indefinitely by 
the groups who performed an individual experiment. Ground-based 
astronomy was no different in this regard. The HST, however, was 
recognized to be a sufficiently unique and limited resource as to make 
an indefinite proprietary period unjustifiable. Instead, any data taken 
by the HST are open for anyone to use after only one year. This policy 
is an effective multiplier for the HST’s impact, allowing all data to be 
used by multiple groups, for multiple purposes, facilitated by the 
HST’s excellent archiving programme. 

This movement towards non-proprietary data was greatly accel- 
erated by the HDF, which was imaged using an unprecedented alloca- 
tion of observing time donated by the director of the Space Telescope 
Science Institute, Bob Williams. The images were reduced by staff at 
the institute and released to the public immediately. The wider astro- 
nomical community responded by investing their own resources in 
spectroscopy and follow-up imaging at other wavelengths, using 
other facilities. This coming together of the community to generate 
a shared, non-proprietary data set was essentially unprecedented, but 
has since become the model for the majority of large astronomical 
projects. Almost all major astronomical surveys are now proposed 
with the expectation that the data and data products will be publicly 
released during the project, rather than held in perpetuity by those 
few who instigated the programme. This new mode of operating has 
democratized astronomy by opening astronomical research to scien- 
tists that are at relatively under-resourced institutions, allowing 
researchers at small colleges, or in poor countries, to have access to 
some of the finest data sets in the world. The policies established by 
the HST’s oversight board were crucial steps in ushering in this 
cultural change. 


The future of the HST 


Owing to the HST’s upcoming servicing mission, the installation’s 
future looks as bright as its past. With increased imaging sensitivity at 
ultraviolet and near-infrared wavelengths, enhanced ultraviolet spec- 
troscopy and the restoration of several failed instruments, the HST 
will surpass any capability it had in the past. Even as astronomers 
master the art of achieving high-resolution imaging from the ground 
through adaptive optics, the HST’s photometric stability, ultraviolet 
sensitivity and low background will remain unparalleled. 
Unfortunately, when the HST is finally decommissioned, the window 
of high-resolution optical and ultraviolet imaging will be sadly closed 
for decades to come. 
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Cell-cycle restriction limits DNA damage 
and maintains self-renewal of leukaemia 


stem cells 


Andrea Viale'*, Francesca De Franco’*, Annette Orleth’**, Valeria Cambiaghi’, Virginia Giuliani’, Daniela Bossi’, 
Chiara Ronchini', Simona Ronzoni’, lvan Muradore’, Silvia Monestiroli', Alberto Gobbi’, Myriam Alcalay’™, 


Saverio Minucci'’” & Pier Giuseppe Pelicci'* 


Rare cells with the properties of stem cells are integral to the development and perpetuation of leukaemias. A defining 
characteristic of stem cells is their capacity to self-renew, which is markedly extended in leukaemia stem cells. The 
underlying molecular mechanisms, however, are largely unknown. Here we demonstrate that expression of the cell-cycle 
inhibitor p21 is indispensable for maintaining self-renewal of leukaemia stem cells. Expression of leukaemia-associated 
oncogenes in mouse haematopoietic stem cells (HSCs) induces DNA damage and activates a p21-dependent cellular 
response, which leads to reversible cell-cycle arrest and DNA repair. Activated p21 is critical in preventing excess 
DNA-damage accumulation and functional exhaustion of leukaemic stem cells. These data unravel the oncogenic potential of 
p21 and suggest that inhibition of DNA repair mechanisms might function as potent strategy for the eradication of the slowly 


proliferating leukaemia stem cells. 


The number of times a single HSC replicates during a lifespan in mam- 
mals is limited (about 80-200), suggesting that self-renewal of HSCs is 
intrinsically restricted’. This limit becomes experimentally evident as 
exhaustion of their regenerative potential when HSCs are induced to 
proliferate rapidly in response to environmental stress, such as in serial 
transplantation or myelosuppressive chemotherapy*. Under these con- 
ditions, cell-cycle restriction of HSCs might become critical, because in 
mice selectively deficient for the cell-cycle inhibitor p21Cip1/Wafl 
(p21 ‘~), HSCs proliferate more and are rapidly consumed’. 

Like normal blood cells, leukaemias are maintained by rare stem 
cells (leukaemia stem cells (LSCs)), endowed with the capacity to self- 
renew and to generate many leukaemic blasts®. Self-renewal of LSCs is 
virtually unlimited, as inferred by their ability to support continuous 
expansion of the leukaemic clone and to be propagated inexhaustibly 
in mice during serial bone marrow transplantation”®. It is not clear, 
however, whether specific mechanisms are selected during leukae- 
mogenesis to protect hyper-proliferating LSCs from exhaustion. This 
paper investigates the role of p21Cip1/Wafl (hereafter called p21) in 
the regulation of LSC self-renewal in mice, using PML-RAR, the 
initiating oncogene of human acute promyelocytic leukaemia 
(APL)’, one subtype of acute myeloid leukaemia (AML). 


p21 maintains self-renewal of PML-RAR HSCs 


As model systems, we used mice expressing the PML-RAR comple- 
mentary DNA from the endogenous cathepsin-G promoter (“knock- 
in’ PML-RAR mice"’), backcrossed in the wild-type (WT) or p21 oo 
C57BL6 backgrounds (PR and PRp21 ‘~ mice). PR mice develop 
APL after a long latency and at a low frequency, suggesting that other 
genetic alterations cooperate with PML-RAR®"®. We analysed stem 
cells in the bone marrow of pre-leukaemic and leukaemic mice. 


We first quantified the frequency of HSCs in the bone marrow of pre- 
leukaemic 2- to 3-month-old mice by immunophenotyping 
and limiting dilution. A unique cell surface phenotype 
(Lin” Kit" Scal*FIk2~CD34~) identifies a sub-population that is 
highly enriched with cells capable of long-term reconstitution of the 
haematopoietic system’*'’ (long-term reconstituting HSCs (LT- 
HSCs); see Fig. la and Supplementary Fig. 2 for representative fluor- 
escence-activated cell sorting (FACS) profiles and expression of PML— 
RAR in HSCs). For limiting dilutions'*, we performed competitive 
transplantation of three concentrations of test bone marrow cells 
(500 X 10°, 100 X 10° and 10 X 10°) using the CD45 (Ly5.1 or Ly5.2) 
congenic system, and estimated the frequency of competitive re-popu- 
lating units (CRUs) by Poisson statistics. The calculated frequencies of 
WT LT-HSCs (Fig. 1b, c) and CRUs (Fig. 1c and Supplementary Fig. 3) 
were, respectively, 1:8.4X 10° and 1:21 X 10°, as reported’. 
Although in the p21 /~ bone marrow we observed comparable fre- 
quencies, in the PR bone marrow the frequency of LT-HSCs was slightly 
diminished (1:14.1 X 10°; Fig. 1b, c), although CRUs were unaffected 
(1:21 X 10°; Fig. 1c and Supplementary Fig. 3). This apparent discre- 
pancy might reflect different sensitivities of the two assays. Surprisingly, 
in the PRp21 ‘~ bone marrow we found normal frequencies of LT- 
HSCs (1:7.0X 10°) and a marked decrease of CRUs (less than 
1:64 X 10°; Fig. 1c and Supplementary Fig. 3). Thus, p21 loss or 
PML-RAR expression does not significantly affect the establishment 
or maintenance of HSC reserves, whereas p21 expression is critical in 
maintaining numbers of functional HSCs in the presence of PML-RAR. 

To investigate the functional properties of PRp21_‘~ HSCs, we 
evaluated their capacity to re-populate an irradiated host by competi- 
tive bone marrow transplantation. Quantification of bone marrow 
reconstitution of mice inoculated with about one PRp21_/~ CRU 
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Figure 1| p21 maintains self-renewal of PML-RAR-expressing HSCs. 

a, b, LT-HSCs (LT), ST-HSCs (ST), multipotent progenitors (MPP) and CMP: 
representative profiles (a) and frequencies (b) of indicated genotypes (mean 
and s.d. of five experiments; four mice per group). ¢, Frequency of LT-HSCs 
and CRUs. d, e, Frequency of peripheral blood or bone marrow donor-derived 
cells after bone marrow competition (d) or serial (e) transplantation, 
respectively. Like others*’, we did not observe premature exhaustion of p21 /~ 
HSCs using the C57BL6 strain, as instead reported for the 129Sv’. f, Frequency 
of donor bone marrow LT-HSCs after 8 weeks transplantation using equivalent 
LT-HSC numbers (mean and s.d. of two experiments; three mice per genotype). 


(100 X 10° test bone marrow cells in the limiting dilution assay 
described above) revealed a slightly higher number of re-populating 
units (PRp21~/~ to WT RU ratio = 1.7). Similarly, the re-populating 
unit ratio in a 1:1 admixture (1 X 10° cells) of PRp21 / ~ and com- 
peting WT bone marrow cells was 1.6 (P<0.05), with 62.5% of 
PRp21 /~ cells in the peripheral blood of long-term reconstituted 
animals (Fig. 1d and Supplementary Fig. 4a). Thus, although num- 
bers of functional HSCs are reduced in the PRp21 /~ bone marrow, 
they possess normal, or slightly increased, re-populating capacity. 
To investigate alternative mechanisms for the reduced pool of func- 
tional HSCs in the PRp21 ‘~ bone marrow, we analysed their lifespan 
by serial transplantation. Briefly, test Ly5.2* bone marrow cells were 
injected into irradiated Ly5.1~ mice, analysed 3-4 months post- 
transplant and re-transplanted. Bone marrow reconstitution with 
WT, PRor p21 /~ donor cells was about 50% after the third transplant 
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and, strikingly, only about 8% with the PRp21 ‘~ bone marrow 
(Fig. le), indicating premature exhaustion of the PRp21_/~ HSCs. 
To determine whether the functional decline of PRp21_‘~ HSCs 
was due to diminished self-renewal capacity, we assayed their prolif- 
erative potential in primary transplant recipients and in vitro. Equal 
numbers of test Ly5.2* LT-HSCs were injected into irradiated Ly5.1* 
mice and bone marrows analysed immunophenotypically after two 
months. The numbers of donor-derived PRp21-’” LT-HSCs was 
about sixfold less, compared with PR LT-HSCs (P< 0.01; Fig. 1f). 
Likewise, the total progeny-cell output of PRp21-/~ HSCs after 
short-term culture was significantly reduced (about fourfold; 
Supplementary Fig. 5), demonstrating that PRp21-/~ HSCs have 
diminished self-renewal capacity. Notably, p21 loss had no significant 
effects on the proliferation of LT-HSCs in the absence of PML-RAR 
expression, either in vivo (Fig. 1f) or in vitro (Supplementary Fig. 5). 
Cumulatively, these results suggest that p21 is critical to the self- 
renewal of PML-RAR-expressing HSCs and to protection from func- 
tional exhaustion in stressful conditions. Because PML—RAR expression 
or p21 loss alone did not reduce the self-renewal capacity of HSCs, their 
synergistic effect suggests that they affect a single function or pathway. 


p21 limits DNA-damage of PML-RAR HSCs 


Functional exhaustion of HSCs was recently described in mice deficient 
in several genomic-maintenance pathways, owing to accumulation of 
genomic damage’. Thus, we examined whether PML-RAR express- 
ion and/or p21 loss induce accumulation of DNA damage by immuno- 
staining for phosphorylation of histone H2AX (yH2AX), as an indicator 
of the cellular response to DNA damage”. Expression of PML-RAR in 
U937 haematopoietic cells induced marked accumulation of YH2AX 
foci (Fig. 2a), suggesting that the fusion protein induces DNA damage. 
The same effect was observed in the HSC-enriched populations of mur- 
ine (lineage depleted: Lin”; Fig. 2b) or human (CD34°; Fig. 2c) origin. 

We then investigated the effect of PML-RAR and p21 on DNA 
integrity in vivo, by quantifying yH2AX foci in the LT-HSCs of 2- to 
3-month-old mice (Fig. 2d and Supplementary Fig. 6). Whereas rare 
YH2AX foci were observed in WT LT-HSCs (about 70% of cells with 
no more than two foci per cell; modal number = 0), as reported”, 
most (about 70%) of the PR LT-HSCs presented multiple foci (at 
least two; modal number = 5). p21 loss also led to increased LT- 
HSCs with yH2AX foci, although at a lower frequency (about 60% 
of cells with at least two foci per cell; modal number = 1). Strikingly, 
PRp21 ‘~ LT-HSCs were almost homogenously positive for anti- 
yH2AX staining (more than 95% of cells with at least two foci per 
cell; modal number = 7). Together, these results demonstrate that 
either PML-RAR expression or p21 loss induces a moderate degree of 
DNA-damage in LT-HSCs, which increases significantly when both 
conditions are present, suggesting that p21 limits DNA-damage 
accumulation in PML—RAR-expressing HSCs. 

The synthetic interaction between p21 loss and PML-RAR expression 
on HSC functions becomes evident when HSCs are induced to prolif- 
erate, as it occurs after bone marrow transplantation. Because hyper- 
proliferation causes DNA-replication stress and genomic damage”, we 
investigated DNA-damage accumulation in actively proliferating HSCs. 
Analysis of yYH2AX foci in cultured LT-HSCs showed increased time- 
dependent DNA-damage in both PR and PRp21 ‘~ samples, which was 
significantly higher in the latter (Fig. 2e). Thus, p21 limits accumulation 
of DNA damage during proliferation of PML—RAR-expressing HSCs. 


p21is upregulated by PML-RAR and induces cell-cycle restriction 
of HSCs 
p21 is upregulated by DNA damage or other types of stress and 
induces reversible cell-cycle arrest, allowing for DNA repair or stress 
resistance****. We thus investigated the effects of PML-RAR on p21 
expression and its consequences on the cell cycle. 

Protein and/or RNA analyses showed increased levels of p21 in 
U937 (Supplementary Fig. 7a—c) and Lin” (Fig. 3a) cells after PML-— 
RAR expression, as well as in the LT-HSCs of the PR bone marrow 
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Figure 2 | Fusion proteins induce DNA damage. Anti-yH2AX 
immunofluorescence: a, different U937 clones (0-yH2AX: green; DAPI: blue; 
X40); b, Lin” cells infected with empty (control), PML—RAR or AML1-ETO- 
expressing green fluorescent protein (GFP) retroviruses (GFP: green; o-yH2AX: 
red; DAPI: blue; X 125); ¢, CD34 cells infected with empty (control), 
PML-RAR or AMLI-ETO retroviruses («-yH2AX: green; DAPI: blue; x 125); 
d, LT-HSCs of indicated genotypes (0-yH2AX: green; DAPI: blue; 125; left), 
numbers of yH2AX foci per cell (right) (error bars denotes.d.);e, PRor PRp21 <a 
LT-HSCs before (day 0) or after 4 days of culture («%-yH2AX: green; DAPI: blue; 
X 125; left). Frequencies of cells showing diffuse staining (arrow marks one 
representative cell) (right). Mean and s.d. of three experiments are shown. 


(Fig. 3b), demonstrating that PML-RAR upregulates p21. Notably, 
we also observed increased levels of p21 expression in freshly isolated 
blasts from murine (Fig. 3b) or human (Fig. 3c) APLs, indicating that 
the effect of PML-RAR on p21 is retained in the fully transformed 
APL blasts. This effect was surprising, because PML—RAR downre- 
gulates p53, a critical activator of p21 expression’’. Analysis of the 
effects of PML-RAR on p21 inp53 ‘~ Lin’ showed that, indeed, p21 
upregulation by PML-RAR is p53 independent (Fig. 3a). 

The effects of p21 on the cell cycle of HSCs were investigated in vivo 
using the 5-bromodeoxyuridine (BrdU) incorporation test. The 
fraction of BrdU-positive LT-HSCs was decreased in the PR bone mar- 
row (P< 0.05), whereas it was increased in the PRp21 ~~ bone marrow 
(P< 0.01) (Fig. 3d and Supplementary Fig. 8a), demonstrating that p21 
induces cell-cycle restriction of PML—RAR-expressing LT-HSCs. This 
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Figure 3 | p21 upregulation and cell-cycle restriction in HSCs. 

a, Quantitative PCR (qPCR) (left; mean and s.d. of three experiments) and 
confocal microscopy (right) of p21 expression in Lin cells infected with empty 
(control), PML—RAR (PR) or AMLI-ETO (AE) GFP-retroviruses («-p21: red; 
DAPI: blue; GFP: green; pixel width: 103-132 nm). b, ¢, qPCR of p21 
expression in MNCs, bone marrow sub-populations and mouse PML-RAR 
leukaemias (relative to WT MNCs) (b) and in healthy CD34" cells and AML- 
patient blasts (expressing AMLI-ETO, CBFB-MYH11, PML-RAR or with 
normal karyotypes) (c).d, BrdU* cells in bone marrow sub-populations (mean 
and s.d. of three experiments; three or four mice per genotype). e, Survival 
curves of PR (n = 18) and PRp21 /~ (n = 11) mice during 5-FU treatment. 


effect of p21 correlates with the observed reduction in LT-HSC fre- 
quency in the PR bone marrow (Fig. 1b, c). For functional confirma- 
tion, we measured the sensitivity of PR and PRp21 ‘~ mice to 
myelosuppression, using a cell-cycle-s “specific drug (5-fluorouracil, 
5-FU). The survival outcome of PRp21 ~*~ mice was significantly lower 
than that of the PR mice (Fig. 3e), thus confirming that the frequency of 
cell-cycle-restricted HSCs is lower in the PRp21 /~ bone marrow. 

Together, these data demonstrate that expression of PML-RAR in 
HSCs results in upregulation of p21, which, in turn, induces cell-cycle 
restriction. They also suggest that this function of p21 limits DNA 
damage accumulation by favouring DNA repair. It remains unclear, 
however, whether the effect of PML-RAR on p21 is direct, the con- 
sequence of PML—RAR-induced DNA damage, or both. 


p21 prevents hyper-proliferation, severe DNA-damage and 
exhaustion of LSCs 

We then investigated whether p21 expression and cell-cycle restriction 
are critical in maintaining self-renewal of LSCs. PRand PRp21 /~ mice 
developed APL with comparable latency, frequency and morphology 
(Fig. 4a, b), indicating that p21 is dispensable for leukaemogenesis by 
PML-RAR. Strikingly, although the PR and PRp21 ‘~ blasts showed 
comparable homing properties (Supplementary Fig. 9), none of the 
PRp21 ‘~ leukaemias was able to propagate the disease when trans- 
planted into syngeneic recipients (Fig. 4c). To confirm these results in 
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another model system of APL’, WT and p21 /~ Lin’ cells were infected 
with a PML-RAR-encoding retrovirus and transplanted into recipient 
mice. Both groups of mice reconstituted haematopoiesis and developed 
APLwith comparable latency and frequency (Supplementary Fig. 10a—c). 
Again, however, no p21‘ APL was transplantable (Supplementary 
Fig. 10d), confirming that p21 is required for PML—RAR LSCs to support 
leukaemia outgrowth after transplantation. 

We next examined the effects of p21 on numbers, cell-cycle properties 
and DNA damage of LSCs, by analysing a sub-population of Lin " blasts 
that expresses the c-Kit and Scal HSC-markers (LKS*; Fig. 4d) and is 
enriched with cells capable of propagating leukaemias upon transplanta- 
tion (unpublished observations). Remarkably, PRp21-/~ LKS* cells 
were reduced in number (about 50-fold; Fig. 4d), showed increased 
BrdU incorporation (about threefold; Fig. 4e and Supplementary Fig. 
8b) and increased DNA damage (Fig. 4f). Thus, like in pre-leukaemic 
HSCs, p21 expression in LSCs imposes cell-cycle restriction, limits accu- 
mulation of DNA damage and prevents their exhaustion. 


p21 response to PML-RAR expression is specific to stem cells 


Enforced expression of activated oncogenes in primary cells, such as 
fibroblasts, induces DNA damage and triggers a DNA-damage res- 
ponse that leads to a halt in proliferation, owing to apoptosis or 
irreversible cell-cycle arrest (senescence), thus preventing propaga- 
tion of DNA alterations*~*°. This cellular response depends on p53 
and is considered a powerful barrier to tumour development*'”’. Like 
other oncogenes, expression of PML—RAR induces DNA damage in 
HSCs, without, however, leading to exclusion of damaged cells. 
Indeed, DNA is damaged in PML—RAR-expressing HSCs (Fig. 2d), 
yet they maintain intact self-renewal (Fig. 1f). We investigated 
whether this response to PML-RAR is specific to HSCs, by analysing 
its effects on downstream progenitors and fibroblasts. 


PR leukaemias 


ane 


| == : - 
204) pp 
Boy pa ERR el 
a 0.0+———.— a 
0 4 6 8 to 12 ; 0 
Time (months) PRp21~- leukaemias 
d e 


PML-RAR leukaemias P<0.01 60 


p21 +/+ 


p2t-- 


c-Kit 
BM frequency (%) 


Oo- NY WOH A OD 


NATURE]|Vol 4571 January 2009 


Expression of PML-RAR in common myeloid progenitors (CMPs) 
induced accumulation of yH2AX foci (Fig. 5a) and upregulation of 
p21 (data not shown). Notably, the effect of the fusion protein on 
DNA integrity was independent of p21 (Fig. 5a). Immunophenotypic 
and BrdU-incorporation analyses showed, in both PR and PRp21 /~ 
bone marrow, increased CMP numbers (Fig. 1b) in the absence of any 
significant effect on their cycling properties (Fig. 3d). This effect of 
PML-RAR on CMP numbers was confirmed by the colony-forming 
cell (CFC) assay (Fig. 5b) and might explain the better performance of 
PR bone marrow after competitive transplantation (Fig. 1d). Because 
PML-RAR inhibits apoptosis of haematopoietic progenitors by 
inhibiting p53 (ref. 27), the expansion of CMPs in PR bone marrow 
might be due to their prolonged survival. Thus, upregulation of p21 
in myeloid progenitors has no effect on their cycling properties, nor 
does it regulate DNA-damage accumulation. Likewise, we observed 
no effects of p21 expression on the frequency (Fig. 4d), cell-cycle 
properties (Fig. 4e) and extent of DNA damage (Fig. 5c) of sub- 
populations of leukaemia blasts that do not express stem-cell markers 
(LSK"). 

Expression of PML-RAR in mouse embryonic fibroblasts (MEFs) 
and human (WI-38) fibroblasts also induced accumulation of 
yH2AX foci and upregulation of p21 (Fig. 5d and data not shown). 
In both cell types, PML-RAR blocked proliferation and induced 
B-galactosidase positivity (Fig. 5e-g and data not shown), a marker 
of cellular senescence*’. This effect of the fusion protein was p21 
dependent and p53 independent, as shown by the opposite beha- 
viours of p21-/~ or p53 / MEFs after PML-RAR expression 
(Fig. 5e, f). These data demonstrate that although PML-RAR induces 
DNA damage and p21 upregulation in all tested target cells, p21 
mediates different cellular responses in HSCs (cell-cycle restriction) 
and fibroblasts (permanent cell-cycle arrest). Thus, the p21 property 


Figure 4 | p21 limits DNA damage in LSCs and is 
indispensable for leukaemogenesis. a, Survival 
of PR (n = 33) and PRp21 ’ (n= 26) mice. 

b, MGG blood smears of leukaemic mice. 

c, Survival of secondary recipients (four each) 
injected with PR (n = 7) or PRp21 /~ (n=7) 
leukaemias. d, Leukaemic LKS* and LKS” sub- 
populations: representative profiles (left) and 
frequencies (right) of indicated samples (mean of 
four genotypes and s.d.). e, In vivo BrdU 
incorporation in LKS* and LKS” sub- 
populations (mean of three leukaemias and s.d.). 
f, Anti-yH2AX staining of LKS* from PML-RAR 
leukaemias («-yH2AX: green; DAPI: blue; 
125). PCR of PB DNA for retrovirus 
integrations (g) and survival (h) of mice injected 
with WT (n = 18) or p21’ (n= 17) 
AMLI-ETO-infected Lin cells. 
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of allowing expansion of DNA-damaged cells in response to PML-— 
RAR expression is specific to HSCs. It remains unclear, however, 
whether this reflects a physiological response of HSCs to DNA 
damage or a selected property of the PML—RAR oncogene in HSCs. 


p21 is indispensable for leukaemogenesis by AML1-ETO 


AMLI-ETO, another AML-specific oncogene™, induced accumulation 
of yH2AX foci and upregulation of p21 in U937 (Fig. 2a and 
Supplementary Fig. 7a—c) and Lin” (Figs 2b and 3a) cells, and in 
MEFs (Fig. 5d). Upregulation of p21 in Lin” cells was p53 independent 
(Fig. 3a). As for PML—-RAR, AMLI-ETO induced cell-cycle restriction 
of U937 cells (Supplementary Fig. 7d and data not shown) and 
senescence of cultured MEFs, in a p2l-dependent and p53- 
independent manner (Fig. 5e, f), although it did not arrest proliferation 
of stem cell/progenitors, as revealed, for example, by the long-term 
culture-initiating cell (LTC-IC) assay (Supplementary Fig. 4b). 
Strikingly, AMLI-ETO failed to induce leukaemias when expressed 
inp21 ‘Lin’ cells.Lin’ cells from WT or p21 /~ mice were infected 
with AMLI-ETO-expressing retroviruses and used to reconstitute 
recipient mice. Analysis of AMLI-ETO integration in the peripheral 
blood DNA of the reconstituted mice demonstrated engraftment in all 
transplanted mice (Fig. 4g). However, 10 of 18 mice (about 55%) 
reconstituted with AMLI-ETO-expressing WT Lin cells developed 
AML with a latency of 4-7 months, as expected (Fig. 4h and 
Supplementary Fig. 10b), whereas none of the 17 mice reconstituted 
with AMLI-ETO-expressing p21 ‘~ Lin” developed leukaemia over a 
period of 12-14 months (Fig. 4h and Supplementary Fig. 10b), thus 
indicating that p21 is critical to the property of AML1-ETO in initiat- 
ing leukaemogenesis. The different effects of p21 loss on PML-RAR 
and AMLI-ETO leukaemogenesis (exhaustion of established leukae- 
mias or inhibition of initiation, respectively) might reflect different 
effects on the lifespans of the corresponding LSCs. Notably, the 
protocol of AML1—-ETO-induced leukaemogenesis involves treatment 
of recipient mice with a potent DNA-damaging agent (nitro- 
sourea)****, which might result in accelerated exhaustion of the 
AMLI-ETO LSCs in the absence of p21. 
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Figure 5 | Effects of fusion proteins on 
progenitors and fibroblasts. Anti-yH2AX 
staining of (a) PR and PRp21 ~~ CMP (X10) and 
(c) PML-RAR leukaemic LKS” (30) (a- 
yH2AX: green; DAPI: blue); b, CFUs from bone 
marrow MNCs (mean + s.d. of three 
experiments). d, Anti-yH2AX staining of MEFs 
(a-yH2AX: green; DAPI: blue; 125). e, Growth 
curves of WT (solid symbols) and p21‘ (open 
symbols) MEFs expressing empty vector 
(control), PML-RAR or AML1-ETO 

(mean = s.d. of three experiments). f, Same as in 
e for p53‘ MEF (mean + s.d. of three 
experiments). g, B-Galactosidase staining of 
WI38 cells expressing fusion proteins. 
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Conclusions 


Our data provide the first demonstration that expression of a cell-cycle 
inhibitor, such as p21, is critical for the initiation and maintenance of 
leukaemogenesis. They are consistent with a model whereby expression 
of leukaemia-associated oncogenes in HSCs induces DNA damage and 
activates a p21-dependent cellular response that, in turn, imposes cell- 
cycle restriction and triggers repair of the damaged DNA 
(Supplementary Fig. 1). This effect of p21 has two consequences, both 
of which might be relevant for leukaemogenesis. The first is the accu- 
mulation of genomically unstable HSCs. In the presence of oncogenic 
fusion proteins, HSCs accumulate moderate degrees of DNA damage, 
which represents the balance of the DNA-damaging effects of the fusion 
proteins, and, seemingly, the DNA-repair activity of upregulated p21. 
Both of these conditions are known to accelerate the rate of mutage- 
nesis*””*, thus providing an explanation for the role of p21 in initiating 
leukaemogenesis. The second consequence is a bypass of the physio- 
logical exhaustion of HSC self-renewal, which occurs in time owing to 
accumulation of DNA damage*”*”’. This effect of p21 confers an advan- 
tage to HSCs when they hyper-proliferate, as it occurs during stress or 
after full transformation (for example, in the LSCs), thus explaining the 
role of p21 in the maintenance of leukaemias (Supplementary Fig. 1). In 
a way, these activities of p21 represent a novel mechanistic contribution 
to tumorigenesis: overexpression of p21, in fact, is not transforming per 
se; it is, however, ‘permissive’ for leukaemia to progress. 

These data bear important implications for the treatment of leukae- 
mias. First, they imply that cell-cycle-restricted LSCs are critical for the 
initiation and/or maintenance of the leukaemic clone, suggesting that 
targeting this compartment might be critical to disease eradication. 
Although this concept was anticipated by the demonstration that 
human LSCs slowly proliferate’*° (see also Supplementary Fig. 1la—c 
for the mouse LSCs), our data provide one underlying molecular 
mechanism, thus paving the way to novel anti-leukaemic strategies. 
Second, they suggest that inhibition of DNA repair might be synthetic 
lethal with oncogene expression. A range of selective inhibitors of 
different DNA damage response pathways have now been identified, 
which might become part of new approaches to targeted cancer therapy. 
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METHODS SUMMARY 


Fusion proteins were expressed into Lin” and CD34" cells using the PINCO 
retroviral vector*!*, Expression of PML-RAR or AMLI-ETO in PR9 or U937- 
AE cells was obtained using 100 uM ZnSO, (ref. 43). WT MEFs or W138 cells were 
infected with PML-RAR or AML1-ETO-expressing pBabe retroviruses and 
puromycine-selected for 4days. Leukaemic mice were generated by retroviral 
transduction of PML—RAR or AMLI-ETO into Lin” bone marrow cells”** of 
129Sv mice. PML-RAR knock-in mice'® were provided by T.J. Ley and back- 
crossed in the C57BL/6] background (PR). Mating PR and p21 _/~ C57BL/6J mice 
generated PRp21//~ mice. The various stem cell/progenitor sub-populations 
(LT-HSC, short-term HSC (ST-HSC), multipotent and common myeloid 
progenitors (MPP and CMP)) were identified by FACS or isolated by cell- 
sorting after staining of mouse bone marrow mononuclear cells (MNCs) with 
fluorochrome-conjugated antibodies against Scal, c-Kit, Flk-2, CD34 or the lin- 
eage markers Cd11b, GR1, Ter-119, IL7-R, CD3, CD4, CD8 and B220. Frequency 
of functional LT-HSCs was determined by limiting bone marrow transplantation, 
injecting decreasing numbers of Ly5.2* test bone marrow MNCs and a fixed 
number of WT Ly5.1° bone marrow MNCs into Ly5.1-irradiated mice. 
Lifespan of HSCs was determined by serial bone marrow transplantation, inject- 
ing Ly5.2~ test bone marrow MNCs into Ly5.1~ recipients, and re-transplanting 
bone marrow MNCs of the reconstituted mice. Self-renewal of HSCs was deter- 
mined by an equivalent transplantation assay, measuring the frequency of Ly5.2* 
donor-derived LT-HSCs in Ly5.1* recipients, transplanted with stem-cell equiva- 
lents of test and control bone marrow-MNCs. HSC re-populating potential was 
determined by a bone marrow competition assay, transplanting equal numbers of 
test Ly5.2* and competitor Ly5.1* HSCs. Cell-cycle properties of HSCs, LSCs or 
progenitor sub-populations were determined by the in vivo BrdU-incorporation 
test. Cell-cycles of HSCs were analysed in vivo by the 5-FU test, measuring survival 
of animals treated with weekly doses of the drug. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. PML-RAR knock-in mice’® were provided by T.J. Ley and backcrossed in 
the C57BL/6J background (PR). p21 mice of mixed (C57BL/6 and 129Sv) 
background were obtained from Jackson laboratory and, for these studies, back- 
crossed in C57BL/6J or 129/Sv pure backgrounds (12 generations of backcrossing). 
PRp21 /~ mice were generated by mating PR and p21-/~ C57BL/6J mice. 129Sv 
p53’ mice were obtained from the Jackson Laboratory. Mouse colonies were 
maintained in a certified animal facility in accordance with national guidelines. 
Limiting bone marrow transplantation. Recipients C57BL/6 Ly5.1 mice (six to 
eight per group, per genotype) were lethally irradiated (10 Gy) the day before 
transplantation, injected with 10*, 10° or 5X 10° Ly5.2* test bone marrow 
MNCs together with 5 X 10° WT bone marrow MNCs (Ly5.1*), and analysed 
4 months post-transplant for the presence of Ly5.2* in the PB. A recipient was 
defined as negative when less than 1% of PB leukocytes were Ly5.2*. CRU 
numbers were determined by Poisson statistical analysis (L-Calc software, 
StemCell Technologies). 

Self-renewal determination. Lethally irradiated recipients (Ly5.1*) were trans- 
planted with stem cell equivalents of test and control bone marrow MNCs (after 
FACS evaluation of LT-HSC frequency). Eight weeks after transplantation, reci- 
pients were killed and the frequency of donor-derived (Ly5.2~) LT-HSC evaluated 
by FACS. 

Bone marrow competition. One million test cells (Ly5.2*) (four to five per 
genotype) and 10° WT competitor cells (Ly5.1~) were pooled and injected into 
lethally irradiated C57BL/6 mice (five mice per genotype). Recipients’ PB was 
analysed for donor contribution by flow-cytometric analysis. Re-populating unit 
(RU) = (percentage of test chimaerism) X (number of competitor cells X 10 °)/ 
(100 — percentage of test chimaerism). 

Serial bone marrow transplantation. Two million Ly5.2* bone marrow test 
MNCs were injected into lethally irradiated C57BL/6 Ly5.1* mice (six per geno- 
type). Twelve to 16 weeks later, mice were killed, their bone marrow analysed for 
Ly5.2 donor contribution by flow cytometry and 2 X 10° MNCs re-injected into 
irradiated recipients (Ly5.1*). 

LTC-IC assay. Stromal layers were prepared from the WT bone marrow, cul- 
tured in 12.5% horse-serum IMDM supplemented with 12.5% FBS, penicillin, 
streptomycin, 1M hydrocortisone and 501M 2-mercaptoethanol at 33°C. 
After 3 weeks, confluent stromal layers were trypsinized, irradiated (15 Gy) 
and subcultured in 6-12 multiwell plates. Cultures were then seeded with bone 
marrow MNCs (2.5 X 10° per well) or with infected and FACS-sorted Lin™ (104 
per well) cells. 

CEC assay. Cells derived from 5-week LTC-IC or 5 X 10* bone marrow MNCs 
were seeded in methylcellulose (Stem Cell Technology) supplemented with 15% 
FCS, 2ng ml! IL3, 2 ng ml! IL6, 50 ng ml! SCE, 60 ng ml! G-CSF and 
20 ngml-' GM-CSF (Peprotech). Colonies were scored after 10 days. 

DNA, RNA and protein analysis. High molecular mass DNA was purified using 
the SNAP Whole Blood DNA Isolation Kit (Invitrogen). PCR analysis 
(GeneAmp, Applied Biosystems) of integration sites was performed on 200 ng 
DNA using the following primers: F-5’-AGGGACCCTATTGACGTTGAC-3’ 
and R-5’-ACAGACAAAGCAAGGCTTGTAG-3’ for PML-RAR; F-5'-GGCTG- 
GCAATGATGAAAACT-3' and R-5'-CGCCATTCAAGGCTGTAGGAG-3’ for 
AMLI1-ETO; F-5'-ACCACAGTCCATGCCATCAC-3’ and R-5’-TCCACCAC- 
CCTGTTGCTGTAG-3’ for GAPDH. Total RNA was extracted using the Midi or 
Micro RNeasy kit (Qiagen) and retro transcribed with random hexamers 
(SuperScript First-Strand Synthesis System; Invitrogen). cDNA (6-8 ng) was 
used for RT-PCR, using the following primers: F-5’-CTGGGAGGGGA- 
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CAAGAG-3’ and R-5'-GCTTGGAGTGATAGAAATCTG-3’ for murine p21; 
F-5'-ACCTGCCAAGTATGATGAC-3’ and = R-5’-ACCACCCTGTTGCTGT- 
AG-3' for murine GAPDH; F-5'-GACTCTCAGGGTCGAAAACG-3’ and 
R-5’'GGATTAGGGCTTCCTCTTGG-3’ for human p21; F-5’-ATCAGCAAT- 
GCCTCCTGCAC-3’ and R-5'-TGGCATGGACTGTGGTCATG-3’ for human 
GAPDH. Western blot antibodies: «-p21 (C19, Santa Cruz), #-RAR (C20, Santa 
Cruz), «-HA (BAbCO) or o-actin (AC-40, Sigma). 

Immunofluorescence. Cells were deposited on charged slides (SuperFrost Plus, 
Menzel-Glaser), fixed (4% paraformaldehyde), permeabilized (0.1% Triton-X), 
blocked (3% BSA and 10% goat serum), stained with different antibodies (c- 
p21: SX118, Dakocytomation; «-phosphorylated Ser139 H2AX: Biolegend and 
Upstate) and counterstained with DAPI (Sigma) and Cy3- or Alexa488-conju- 
gated secondary antibodies. Images were captured with a CCD (charge-coupled 
device) camera (Hamamatsu B/W CCD Camera CJ895), using a wide-field 
(Olympus BX61) or a confocal (Leika TCS SP2 AOBS) microscope with a 
x63/1.4 NA oil immersion objective (HCX PL APO X63 Lbd BI Leica 
Microsystems). 

Flow cytometric analyses and sorting of haematopoietic sub-populations. 
Donor contribution and multilineage reconstitution were assessed in PB samples 
by FACS analysis of the expression, respectively, of Ly5.1/Ly5.2 antigens or GR1, 
CD3 and B220 markers (eBioscience) in the Ly5.2* fraction. Homing of PML- 
RAR leukaemic blasts was investigated by injecting 20 X 10° blasts (Ly5.2*) in 
the tail vein of Ly5.1* recipient mice. After 24h, animals were killed and bone 
marrow mononuclear cells stained with 7-CD45Ly5.2. To analyse and isolate 
bone marrow LT-HSC, ST-HSC, MPP and CMP sub-populations!''**"°, 
MNCs, after blocking with 10% BSA and 10% rat serum, were stained with 
fluorochrome-conjugated antibodies against Scal (PE-CY5.5), c-Kit (APC), 
Flk-2 (PE), CD34 (FITC) or the lineage markers (Cd11b, GR1, Ter-119, IL7- 
R, CD3, CD4, CD8 and B220) (PE-CY7) (eBioscience). Samples were fixed with 
2% formaldehyde (FA) for FACS analysis (FACSCanto, BD) or used fresh for cell 
sorting (FACSAria, BD). To stain leukaemic cells, we used a lineage-cocktail 
without myeloid markers, because Cd1lb and GRI1 are co-expressed in the 
c-Kit/Scal double positive blasts (unpublished observations). BrdU incorpora- 
tion was analysed after four intraperitoneal injections (1 mg BrdU per mouse, 
every 6h, into healthy and pre-leukaemic mice; mice killed 24h after first injec- 
tion'’) or a single intraperitoneal injection (1 mg BrdU per mouse into overt 
leukaemic mice; mice killed 12 h after injection) using the FITC BrdU Flow Kit 
(BD). In this setting, surface staining was performed as described above except 
for the use of a biotinylated antibody against CD34 (eBioscience), detected by 
APC-CY7-conjugated streptavidin (eBioscience). To evaluate proliferation of 
LT-HSC, an equivalent number of sorted cells were cultured for 3-4 days in 
RPMI medium containing 10% FBS, 10 ng ml! IL-3, IL-6, IL-11, SCF, TPO 
and FL3 ligand. 

5-FU treatment. 5-FU (TEVA) was administered weekly intraperitoneally at a 
dose of 150mgkg'. 

PKH assay. Bone marrow leukaemic blasts, stained with PKH-26 (Sigma) and 
FACS sorted, were injected in syngeneic recipients (10 cells per mouse). When 
leukaemic, mice were killed and the bone marrow PKH* cells FACS-sorted and 
re-injected into secondary recipients (10° cells per mouse). 

Statistical analysis. In vitro and in vivo data are presented as the mean + s.d. 
(standard error). Statistical analysis was performed using a two-tailed Student’s 
t-test after the evaluation of variance. Results from survival experiments were 
analysed with a log-rank non-parametric test and expressed as Kaplan—Meier 
survival curves. 
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WSTF regulates the H2A.X DNA damage 
response via a novel tyrosine kinase 
activity 


Andrew Xiao', Haitao Li*, David Shechter’, Sung Hee Ahn’, Laura A. Fabrizio, Hediye Erdjument-Bromage’, 
Satoko Ishibe-Murakami’, Bin Wang*, Paul Tempst®, Kay Hofmann’, Dinshaw J. Patel’, Stephen J. Elledge* 
& C. David Allis’ 


DNA double-stranded breaks present a serious challenge for eukaryotic cells. The inability to repair breaks leads to genomic 
instability, carcinogenesis and cell death. During the double-strand break response, mammalian chromatin undergoes 
reorganization demarcated by H2A.X Ser 139 phosphorylation (y-H2A.X). However, the regulation of y-H2A.X 
phosphorylation and its precise role in chromatin remodelling during the repair process remain unclear. Here we report anew 
regulatory mechanism mediated by WSTF (Williams-Beuren syndrome transcription factor, also known as BAZ1B)—a 
component of the WICH complex (WSTF-ISWI ATP-dependent chromatin-remodelling complex). We show that WSTF has 
intrinsic tyrosine kinase activity by means of a domain that shares no sequence homology to any known kinase fold. We show 
that WSTF phosphorylates Tyr 142 of H2A.X, and that WSTF activity has an important role in regulating several events that 


are critical for the DNA damage response. Our work demonstrates a new mechanism that regulates the DNA damage 
response and expands our knowledge of domains that contain intrinsic tyrosine kinase activity. 


One hallmark of the mammalian DNA double-strand break (DSB) 
response is the formation of ionizing-radiation-induced foci (IRIF), 
which are composed of compacted chromatin and numerous DNA 
repair and checkpoint proteins'’’. Despite considerable progress in 
understanding the signalling pathways leading to checkpoint control 
and DNA repair, the nature of the specialized chromatin structures at 
IRIF is not well understood. One of the earliest events occurring at 
IRIF is the phosphorylation of H2A.X (also known as H2afx), a 
specialized histone H2A variant, at Ser139 (referred to as 
y-H2A.X) by the ATM and ATR kinases*. H2A.X-deficient mouse 
embryonic fibroblasts (MEFs) and B and T cells show pronounced 
levels of genomic instability*. Class-switch recombination and sper- 
matogenesis are also defective in H2A.X-deficient mice, further indi- 
cating its involvement in DNA damage repair**. Moreover, H2A.X 
deficiency accelerates B and T cell lymphoma development in p53- 
deficient mice*’. Consistent with these functions in mammalian cells, 
phosphorylation of the equivalent site on yeast H2A (Ser 129) is 
found at DSB sites and spreads to ~50 kilobase (kb) of the flanking 
regions*. In mammals, this phosphorylation event directly recruits 
Mdcl and sets in motion the recruitment of other factors such as 
53BP1 (also known as Trp53bp1), Rnf8 and the Brcal A complex’. 
Furthermore, several recent studies have also indicated that ATP- 
dependent chromatin remodelling complexes are engaged in DNA 
repair pathways at these sites. The yeast NuA4 and INO80 complexes, 
for example, are recruited to the damaged chromatin by 
y-H2A.X!0-2, 

Mammalian H2A.X bears several differences with lower eukar- 
yotes. For example, H2A.X is a minor H2A variant in mammalian 
cells (1-10%, ref. 3), whereas the main yeast form of H2A is most 
similar to H2A.X because it contains the signature carboxy-terminal 


sequence of mammalian H2A.X’. In mammalian cells, the loss of 
SWI/SNF chromatin remodelling complex expression leads to 
defects in the H2A.X DNA damage response. However, it is unclear 
whether these defects are due to the lack of direct regulation by the 
SWI/SNF complex or to other indirect pathways’’. Given the unique 
features of mammalian H2A.X, we sought to find new factors that are 
directly involved in regulating H2A.X. Our results define a novel 
DNA damage response pathway regulating H2A.X function, 
mediated through the WSTF-SNF2H chromatin remodelling com- 
plex and the phosphorylation of H2A.X at Tyr 142 in mammals. 
Unexpectedly, we determined that the amino-terminal domain of 
WSTE, including its WAC domain, exhibits tyrosine kinase activity 
towards Tyr 142 of H2A.X. We further show that the tyrosine kinase 
activity of WSTF is required for eliciting a number of critical molecu- 
lar events during the DNA damage response in mammalian cells. 


Regulation of Tyr 142 phosphorylation 


A role in the DNA damage response for mammalian H2A.X (y- 
H2A.X) is well documented, although its regulation and underlying 
mechanism of action is only partially understood (reviewed in ref. 3). 
Further inspection of the C terminus of H2A.X showed a tyrosine 
(Tyr 142 in mammals) that exists in metazoans, but is absent in uni- 
cellular eukaryotes such as yeast (Fig. la). Notably, two forms of 
H2A.X exist in the Xenopus genome, which are different at this residue 
(the ‘F and the ‘Y’ form, Fig. 1a), are differentially expressed during 
development (D.S. etal., manuscript in preparation). Although recent 
studies suggest a role for Tyr 142 in recruiting Mdcl (refs 14-19) the in 
vivo function of Tyr 142, especially in the regulation of y-H2A.X, 
remains unclear'’. We proposed that Tyr 142 might be phosphory- 
lated in H2A.X under certain physiological conditions (Fig. 1b). 
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School, Boston, Massachusetts 02115, USA. °Miltenyi Biotec GmbH, 50829 Koeln, Germany. 
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Figure 1| Tyr 142 of H2A.X is a new phosphorylation mark regulated by 
DNA damage signals. a, Comparison of the extreme C-terminal sequence 
of H2A.X demonstrates that Tyr 142 is conserved in metazoans (mammals, 
frogs and fruitflies) but not in unicellular eukaryotes. b, Primary MEF cells 
were treated with 10 Gy of ionizing radiation (IR) and recovered for a 
period of time as indicated. Acid-extracted histones were separated by 
SDS-PAGE and subjected to immunoblotting. H2A.X Tyr 142 phosphoryla- 
tion levels in MEFs gradually decline, reaching a minimum at 8 h. The 
y-H2A.X signal is initiated on damage and maintained up to 16h after 
ionizing radiation. 


Preliminary studies using a pan anti-phosphotyrosine antibody indi- 
cated that H2A.X was phosphorylated before DNA damage (data not 
shown). To investigate further whether Tyr 142 is indeed phosphory- 
lated, an antibody raised against an extreme C-terminal peptide con- 
taining phosphorylated Tyr 142 of H2A.X was generated and shown to 
be highly selective for H2A.X Tyr 142 phosphorylation (hereafter 
termed anti-H2A.X Tyr 142(ph); Supplementary Fig. 1). In MEF cells, 
Tyr 142 is constitutively phosphorylated under normal growth con- 
ditions and becomes gradually dephosphorylated during the DNA 
damage response, while y-H2A.X increases (Fig. 1b). As it has been 
reported that H2A.X is highly enriched in Xenopus oocytes and 
eges””*', we addressed whether its Tyr 142 phosphorylation status 
would be altered in response to DNA damage treatment, in the context 
of Xenopus early embryonic development. Similar to mammalian 
cells, Tyr 142 phosphorylation levels are greatly decreased in response 
to DNA damage (Supplementary Fig. 1a). 

Because the major form of H2A in yeast carries the “SQEL’ motif 
(Fig. 1a and ref. 3), we examined whether a point mutant mimicking 
the mammalian ‘SQEY’ motif would constitute a new phospho- 
acceptor. Similar to the mammalian cells, this mutant yeast strain 
(Leu132Tyr) is strongly reactive to anti-H2A.X Tyr 142(ph) antibodies 
with reactivity greatly diminished after DNA damage treatment 
(Supplementary Fig. 1b). Collectively, these results raise the intriguing 
possibility that an evolutionarily conserved DNA-damage-regulated 
phosphotyrosine pathway exists and regulates H2A.X and probably 
other DNA damage responsive proteins. 


The WSTF-SNF2H complex interacts with H2A.X 


To investigate the mechanisms that regulate H2A.X function, we 
sought to isolate protein complexes directly associated with H2A.X 
in vivo. Primary H2A.X ’ MEF cells were reconstituted with 
H2A.X constructs (wild type or Tyr142Phe mutant), with or without 
an N-terminal Flag epitope tag. The Flag tag did not interfere with 
typical DNA damage response pathways, including y-H2A.X (data not 
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shown). Three independent cell lines (wild type and mutants) were 
constructed in which the expression levels were comparable (50-80%) 
to the control primary MEFs (with two intact endogenous H2A.X 
alleles) and most of the reconstituted cells (~90%) have similar 
expression levels (data not shown). To enrich for proteins or com- 
plexes that may associate with H2A.X mononucleosomes, we adapted 
an approach that successfully enriched such chromatin particles for 
subsequent biochemical purifications”. This method, followed by 
immunoprecipitation with anti-Flag antibodies, greatly enriched 
H2A.X-containing mononucleosomes and _ associated proteins 
(Fig. 2a and Supplementary Fig. 2). As shown by silver staining, a small 
number of polypeptides besides histones were associated with the wild- 
type H2A.X mononucleosomes at a stoichiometric level, the amount of 
which was decreased after cells were treated with ionizing radiation 
(Fig. 2b). In addition, these interactions are reduced with Tyr142Phe 
mutant H2A.X mononucleosomes (Fig. 2b). Subsequent mass spec- 
trometry analysis demonstrated that two of the more prominent 
proteins are SNF2H (also known as SMARCAS; 140 kDa), a mam- 
malian homologue of the ISWI ATPase**™, and WSTE (171 kDa), a 
member of the BAZ/WAL family chromatin remodelling factors 
(Supplementary Fig. 3)’. These proteins constitute the WICH com- 
plex (WSTF-ISWI ATP-dependent chromatin-remodelling complex), 
which mobilizes nucleosomes in vitro and is suggested to be involved in 
the regulation of DNA replication”. 

We also investigated if other protein factors present in ISWI-con- 
taining chromatin remodelling complexes, such as CHRAC15 (also 
known as CHRAC1) and CHRACI17 (POLES3) (ref. 30), co-purified 
with H2A.X-containing mononucleosomes at sub-stoichiometric 
levels. However, none of these components were detected by 
either mass spectrometry or immunoblotting approaches (data not 
shown), leading us to the tentative conclusion that only the WICH 
complex copurifies with H2A.X-containing mononucleosomes at 
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Figure 2 | The WSTF-SNF2H chromatin remodelling complex is 
specifically associated with H2A.X nucleosomes in vivo. a, Purification 
scheme of H2A.X-containing mononucleosomes. MEFs reconstituted with 
Flag—H2A.X (wild type or Tyr142Phe) were treated with or without ionizing 
radiation (IR). b, Two polypeptides migrating at 145 and 171 kDa were 
associated with undamaged wild type (WT, left) but not the Tyr142Phe 
mutant (right) H2A.X-mononucleosomes in a silver-stained gel. Mass 
spectrometry analyses showed that these polypeptides were WSTF (171 kDa) 
and SNF2H (145 kDa). The third band at 50 kDa was identified by mass 
spectrometry as f-actin. Staining and mass spectrometry of the lower 
molecular weight bands (<20 kDa) showed similar levels of H2A.X and 
other core histones. ¢, The association between the WICH complex and 
undamaged wild-type H2A.X mononucleosomes was confirmed by 
immunoprecipitation (IP) and western blotting experiments. d, In WSTF 
RNAi cells, the expression level of WSTF was greatly diminished, and the 
H2A.X Tyr 142 phosphorylation level was greatly reduced. 
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stoichiometric levels in vivo. We next confirmed these association 
results by immunoblot analyses of the complexes co-immunopreci- 
pitated with H2A.X-containing nucleosomes. Consistent with the 
mass spectrometry results, WSTF and SNF2H were enriched in these 
complexes (Fig. 2c). 

To investigate the roles of WSTF in regulating the function of 
H2A.X, several independent WSTF knockdown NIH3T3 cell lines 
(WSTF RNA interference (RNAi) cells) were generated by infection 
with a short hairpin WSTF RNAi construct, in which the WSTF 
protein expression is greatly reduced (Fig. 2d). Additionally, we 
attempted to knockdown SNF2H with a similar approach, but 
SNF2H knockdown resulted in S-phase defects and cell death (data 
not shown) as described in SNF2H-knockout mice studies*’; 
presumably, SNF2H functions in other chromatin remodelling com- 
plexes that are involved in a variety of vital cell functions”. The 
specificity of the RNAi knockdown approach was demonstrated by 
rescuing WSTF expression with the human WSTF gene (the 
sequences in the WSTF messenger RNA targeted by the RNAi con- 
struct are different between human and mouse; see later). Notably, 
WSTF deficiency leads to a decrease in Tyr 142 phosphorylation 
(Fig. 2d), indicating that WSTF may be also directly involved in 
regulating Tyr 142 phosphorylation in vivo (see below). These data 
suggest a regulatory mechanism of H2A.X that involves the WICH 
complex and Tyr 142 phosphorylation. 


WSTF is a tyrosine kinase that phosphorylates Tyr 142 of H2A.X 


To investigate further the function of WSTF, we purified recombin- 
ant, full-length human WSTF proteins in insect cells (Fig. 3a). A 
single protein band migrating at the expected molecular weight 
(171 kDa) was detected by silver staining (Fig. 3b). Mass spectro- 
metry analysis on the purified samples confirmed that this protein 
was WSTF, and did not detect any other proteins except trace 
amounts of heat shock proteins (data not shown). The recombinant 
full-length WSTF protein phosphorylated H2A.X-containing 
nucleosomes (Fig. 3b) that were reconstituted in vitro, using ATP 
and divalent manganese ion Mn** (but not Mg’”, see 
Supplementary Fig. 4) as cofactors. To address whether WSTF can 
specifically phosphorylate Tyr 142 of H2A.X, we also reconstituted 
Tyr142Phe mutant nucleosomes. The Tyr142Phe mutant does not 
affect histone octamer or nucleosome formation during reconstitu- 
tion (data not shown). In in vitro kinase assays, WSTF phosphory- 
lated the wild-type but not Tyrl42Phe H2A.X, indicating that 
Tyr 142 is the main site of phosphorylation (Fig. 3b). The in vitro 
kinase activity of WSTF towards Tyr 142 of H2A.X was confirmed by 
immunoblotting using anti-H2A.X Tyr 142(ph) antibodies (Fig. 3c). 

Because no detectable proteins other than trace amounts of heat 
shock proteins co-purified with WSTF from insect cells (Fig. 3b), we 
proposed that WSTF has an intrinsic kinase activity. However, WSTF 
does not share homology with known kinase domains. Therefore we 
generated truncated recombinant WSTF proteins (Fig. 3a) to invest- 
igate which portion of the protein is required for this kinase activity. 
Using this approach, we demonstrated that the N-terminal portion of 
WSTEF is necessary and sufficient for kinase activity, strongly indi- 
cating that the kinase motif resides in its N-terminal portion (Fig. 3d). 
Additional structure function analyses determined that residues 
1-345 contained the active kinase domain composed of a WAC 
domain and a newly identified C-motif that are both required for 
optimal kinase activity (Fig. 3e, g and Methods). A Cys338Ala muta- 
tion generated a kinase dead allele that maintained proper folding 
(Fig. 3f, Supplementary Fig. 6 and Methods). To confirm the intrinsic 
activity of the putative WSTF kinase domain further, we generated 
recombinant WSTF proteins in Escherichia coli, because protein 
kinases, especially tyrosine kinases, are rare in the E. coli genome. 
Bioinformatic and biochemical analysis of the kinase domain of 
WSTEF reveals two well-structured motifs (N-motif, containing the 
conserved WAC domain”, and a previously unreported C-motif; 
Fig. 3a and Supplementary Fig. 7). Co-expression of both motifs 
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Figure 3 | WSTF contains a kinase domain that phosphorylates Tyr 142 of 
H2A.X. a, Schematic demonstration of the domain architecture of the 
human WSTF protein and a series of recombinant proteins representing 
portions of WSTF. Bromo, bromo domain; FL, full-length; GST, glutathione 
S-transferase; PHD, PHD (plant homeodomain) finger domain; SP, SP 
sepharose chromatography. b-f, Recombinant WSTF proteins were 
generated in insect cells; the N- and C-motifs (g) were generated in E. coli. 
b, Tyr 142 in the nucleosome is phosphorylated by recombinant full-length 
WSTF protein. WT, wild type. c, The specific phosphorylation of H2A.X at 
Tyr 142 was detected by immunoblotting with anti-H2A.X Tyr 142(ph) 
antibodies. In d—g, free H2A.X proteins were used as substrates in in vitro 
kinase assays. d, N-WSTF, but not C-WSTF, had kinase activity. e, The 
WSTE 1-340 construct had much reduced kinase activity (<50-fold) in 
comparison to the other constructs. f, The Cys338Ala point mutant (derived 
from wild-type 1-359) had much reduced kinase activity. This mutation 
does not change the global folding properties of the kinase domain 
(Supplementary Fig. 6). g, The co-expressed N-motif and C-motif had 
enhanced kinase activity towards H2A.X, whereas the N-motif alone had 
minimal kinase activity. The C-motif construct, when expressed alone, was 
unstable and partially degraded from its C terminus, however, it was 
protected when co-expressed with the N-motif. 


restores the kinase activity, whereas the N-terminal motif alone 
had a much reduced activity (Fig. 3g). These results strongly indicate 
that the N- and C-motif are required for the optimal kinase activity. 
Collectively, our results demonstrate that WSTF has a previously 
unidentified intrinsic kinase activity by means of a novel kinase 
domain. 
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WSTF has a critical role in foci formation 

Because the functions of the WICH complex, especially that of WSTF, 
in the DNA damage response have not been explored, we investigated 
the role of WSTF in the DNA damage response. On DNA damage 
treatment, the initial y-H2A.X phosphorylation of H2A.X and its foci 
formation in WSTF RNAi cells (as described in Fig. 2) were similar to 
control cells (up to 1h after ionizing radiation, Fig. 4a). However, in 
control cells, the y-H2A.X level remained unchanged until 16h after 
ionizing radiation. In contrast, the level of y-H2A.X phosphorylation 
rapidly declined in WSTF RNAi cells 4h after ionizing radiation 
treatment (Fig. 4a). It is well-established that the number and 
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morphology of y-H2A.X foci undergo notable changes during the 
DNA damage response. Initially, a larger number of small foci are 
formed, whereas at late stages of the DNA damage response, only a 
few large foci are usually observed*’. As expected, large y-H2A.X foci 
were formed in control cells starting 4h after ionizing radiation 
(Fig. 4b, top panels) and persisting until 12 h, whereas the overall level 
of y-H2A.X phosphorylation remains relatively constant. However, 
this morphological progression was not observed in WSTF RNAi cells 
despite the initial formation of small y-H2A.X speckles as in the con- 
trols (Fig. 4b, bottom panels). Instead, the amount and intensity of the 
y-H2A.X foci were much less than in controls (only 16% of the WSTF 
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Figure 4 | WSTF is critical for the maintenance of y-H2A.X 
phosphorylation after DNA damage. a, y-H2A.X maintenance is defective 
in WSTF RNAi cells after DNA damage treatment. The time points labelled 
indicate the recovery time after the ionizing radiation treatment. 

b, Immunofluorescent staining experiments were performed on control and 
WSTF RNAi cells fixed at different time points (as labelled) following 10 Gy 
of ionizing radiation treatment. ¢, d, Mdcl and phos-ATM recruitment are 
also defective in WSTF-deficient cells. Immunofluorescent staining 
experiments were performed with anti-ATM Ser 1981(ph) and anti-Mdcl 
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antibodies on control or WSTF RNAi cells, 8h after 10 Gy of ionizing 
radiation treatment. DAPI, 4,6-diamidino-2-phenylindole. e, WSTF RNAi 
cells were complemented with the wild-type (WT) or mutant (Cys338Ala) 
kinase domain constructs (1-359, Myc-epitope-tagged at the N terminus) of 
WSTE. Eight hours after treated with 10 Gy of ionizing radiation, cells were 
fixed and co-stained with y-H2A.X (green) and anti-Myc antibodies (red). 
f, The kinase activity of WSTF (red) is also critical for phos-ATM foci 
maintenance (green). Original magnification b-f, x60. 
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RNAi retain y-H2A.X foci 4h after ionizing radiation treatment), and 
no large foci were observed in WSTF RNAi cells during the entire 
experiment, even 12h after ionizing radiation (Fig. 4b). Because the 
major kinases for y-H2A.X phosphorylation are ATM and ATR, and 
y-H2A.X foci maintenance is dependent on sustained recruitment of 
active ATM to the damage foci via Mdcl(ref. 17), we addressed 
whether the relocalization of ATM and Mdcl were defective in 
WSTF-deficient cells. In control cells, large Mdc1 foci were observed 
at a late stage of the DNA damage response (8 h after 10 Gy of ionizing 
radiation; Fig. 4c), whereas similar foci were not observed in the 
WSTF-deficient cells, indicating that the recruitment of Mdcl is 
defective at a late stage of the DNA damage response in these cells. 
Autophosphorylation at Ser 1981 (referred to as phos-ATM hereafter) 
is one of the indicators of ATM activation during the DNA damage 
response*°. Foci formation of phos-ATM (and potentially other 
targets of ATM) was also impaired in WSTF-deficient cells (Fig. 4d). 
These data strongly suggest that WSTF has a critical role in the recruit- 
ment of active ATM and Mdcl to the damage sites, both of which are 
critical for y-H2A.X foci formation. 

We next investigated whether the kinase activity of WSTF is 
required for its function during the DNA damage response, namely 
the maintenance of phos-ATM and y-H2A.X foci. Because the WSTF 
mRNA sequences targeted by the short hairpin RNAi constructs are 
different between human and mouse (see earlier), we complemented 
WSTF RNAi cells (derived from mouse 3T3 cells) with wild-type or 
Cys338Ala mutant kinase domain constructs of human WSTF. This 
approach successfully restored the expression of kinase domain of 
WSTF (Fig. 4e, f). The expression of the wild-type WSTF kinase 
domain rescued the y-H2A.X and phos-ATM foci formation defects 
in the WSTF RNAi cells (Fig. 4e, f). In nearly 100% of the cells 
complemented with the wild-type construct, y-H2A.X and phos- 
ATM foci were observed at least 8h after DNA damage treatment 
(Fig. 4e, f). Conversely, the phos-ATM and y-H2A.X foci formation 
were as defective in cells expressing the Cys338Ala mutant as in 
WSTF RNAi cells containing vector alone (Fig. 4e, f). Taken together, 
these results demonstrate that WSTF has an important involvement 
in the DNA damage response by means of its kinase activity—an 
activity that targets H2A.X Tyr 142 and possibly other yet unknown 
substrates. 

We wondered, however, whether Ser 139 phosphorylation was 
required for the steady-state balance of H2A.X phosphorylation at 
Tyr 142, or vice versa. As previously reported'>'*, the maintenance of 
y-H2A.X phosphorylation is affected in the Mdcl-’~ MEE cells 
(Supplementary Fig. 8). Tyr 142, however, becomes gradually depho- 
sphorylated in the Mdcl~’~ cells as in the wild-type control cells 
(Supplementary Fig. 8), indicating that sustained y-H2A.X phos- 
phorylation is not required for Tyr142 dephosphorylation. 
Conversely, the Ser 139 phosphorylation level and foci formation 
were greatly reduced after DNA damage in both Tyr142Leu and 
Tyrl42Phe mutant cells (Supplementary Fig. 8). In addition, muta- 
tions on Tyr 142 affect Mdcl binding to the H2A.X C-tail phosphory- 
lated at Ser 139 (Supplementary Fig. 8). These data suggest that these 
phosphorylation events may be coordinated during the DNA damage 
response (see later in the text). 


Discussion 


Our studies highlight a new regulatory mechanism controlling his- 
tone H2A.X function mediated by the WSTF—SNF2H (WICH) chro- 
matin remodelling complex. We found a previously unknown 
phosphorylation site on H2A.X, Tyr 142, and demonstrated that it 
has a vital role in the DNA damage response. We also discovered that 
the N-terminal domain of WSTF has a previously unrecognized tyr- 
osine kinase activity for this site (see below). WSTF, a gene frequently 
deleted in human Williams—Beuren (also known as Williams) syn- 
drome, has previously been shown to be the regulatory component of 
the WICH chromatin remodelling complex’. Our findings have 
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therefore identified an unexpected link between a histone minor 
variant, H2A.X, a histone covalent modification and ATP-dependent 
chromatin remodelling mechanisms that control the mammalian 
DNA damage response. 

The kinase domain of WSTF is composed of two putative motifs. 
The N-motif contains the highly conserved WAC domain, which is 
found in many eukaryotic proteins, whereas the C-motif is much 
more divergent. Besides H2A.X, WSTF may have other downstream 
targets given its multiple roles in DNA damage repair, replication and 
transcription. Williams—Beuren syndrome is a well-documented 
genetic disorder characterized by developmental defects and clinical 
behavioural phenotypes”’, so identifying the link between the bio- 
chemical function of WSTF and the clinical manifestation of 
Williams—Beuren syndrome remains a challenge for future studies. 
In addition, WSTF kinase activity and Tyr 142 phosphorylation may 
have several roles during DNA damage repair (see Supple- 
mentary Information for a detailed discussion). First, pre-existing 
phosphorylated Tyr 142 may be needed at the break to adjust local 
chromatin structure for the later maintenance of Ser 139 phosphor- 
ylation. Subsequent dephosphorylation of Tyr142 after DNA 
damage could act to enhance Mdcl and ATM recruitment to extend 
and maintain y-H2A.X phosphorylation. Second, WSTF may regu- 
late y-H2A.X foci maintenance through other pathways and uniden- 
tified targets, which may be involved in recruiting or stabilizing Mdcl 
and ATM at damage foci. Finally, in the absence of Tyr 142 phos- 
phorylation, the kinetics of the phosphorylation/dephosphorylation 
cycle of y-H2A.X may be altered such that both the phosphorylation 
and dephosphorylation of H2A.X on Ser 139 occur much more 
rapidly, and the foci form and disassemble in less than 4h. 
Identification of the signalling pathways using H2A.X Tyr 142 phos- 
phorylation or other physiologically relevant phosphorylation events 
regulated by the WICH chromatin remodelling complex will be of 
interest in future studies. 


METHODS SUMMARY 

Plasmids and cell culture. All plasmid constructs were generated with standard 
protocols (see Methods). Recombinant MSCV virus carrying H2A.X or WSTF 
constructs were packaged in Phoenix cells and used to infect MEFs or NIH3T3 
cells. Murine stem cell virus (MSCV) production and infection followed stand- 
ard protocols*’. For each construct, three independent lines were derived. The 
expression levels were checked by immunoblots and immunofluorescence. 
Purification of H2A.X-containing mononucleosomes and associated protein 
factors. MEF fractionation and chromatin pellet isolation were performed as 
described**. Chromatin pellets were briefly digested with Mnase (Sigma), and the 
generation of mononucleosomes was monitored by electrophoresis** and subject 
to mass spectrometry analysis. 

Generation of recombinant WSTF proteins. Recombinant baculovirus car- 
rying the full-length WSTF (Flag-epitope-tagged at C terminus) was produced 
using the BaculoGold-BEVS kit (BD Biosciences). The full-length WSTF 
proteins were purified using a published protocol”. The viruses that express 
truncated WSTF constructs were produced with the Bac-to-Bac (Invitrogen) 
approach and purified (Methods). The N- and C-motif of the WSTF kinase 
domain were generated in E. coli (Methods). 

In vitro kinase assay conditions. Recombinant full-length (50-100 ng) or trun- 
cated WSTF proteins (0.1—-0.5 tig) were incubated with free histone proteins 
(1-2 1g), histone octamers or nucleosomes assembled in vitro (0.5—1 1g) in 
20 mM Tris, pH 7.4, and 150mM NaCl supplemented with **P-y-ATP (NEN) 
and 1mM MnCl, at 25 °C for 45 min. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plasmids and cell culture. Tyr 142 mutant constructs were derived from wild- 
type human H2A.X plasmids (Openbiosystems) using Quickchange kit 
(Stratagene) and verified by DNA sequencing. The primers were: Tyr142Leu 
forward, 5’-CAAGAAGGCCACCCAGGCCTCCCAGGAGCTCTAAG-3'; Tyr142Leuv 
reverse, 5’-CTTAGAGCTCCTGGGAGGCCTGGGTGGCCTTCTTG-3’; Tyr142Phe 
forward, 5’-CAAGAAGGCCACCCAGGCCTCCCAGGAGTTCTAAG-3’; Tyr142Leu 
reverse, 5’-CTTAGAACTCCTGGGAGGCCTGGGTGGCCTTCTTG-3’. These 
plasmids were amplified by PCR and cloned into pTOPO vectors (Invitrogen). 
The primers were: wild-type forward, 5’-CACCTCGGGCCGCGGCAAGA- 
CTG-3'; wild-type reverse, 5'-CTTAGTACTCCTGGGAG-3’; Tyr142Leu, 
5'-CTTAGAGCTCCTGGGAG-3’; Tyr142Phe, 5’-CTTAGAACTCCTGGGAG-3’. 

To generate N-terminal Flag-tagged constructs, the wild-type or mutant 
H2A.X coding region in the pTOPO vectors were cloned in-frame between the 
BamH1 and Xhol sites of the pCMV-Tag2A vectors (Stratagene). Tagged or 
untagged constructs were cloned into MSCV-puro vectors (gifts from 
W. Herr’s laboratory). To generate constructs for recombinant protein express- 
ion in E. coli, the coding regions of the wild-type or Tyr142Phe H2A.X of the 
untagged vectors (above) were cloned into pET28 vectors (Novagen), in-frame 
with the N-terminal 6 x His tag. 

MSCV virus production and infection followed standard protocols”. In brief, 
MSCV viruses were packaged in Phoenix cells and used to infect H2A.X /~ 
MEFs (gifts from A. Nussenzweig’s laboratory), followed by puromycin selec- 
tion. For each construct, three independent lines were derived. The expression 
levels of H2A.X were checked by immunoblots and immunofluorescence. 

Full-length WSTF (human) constructs was a gift from P. Varga-Weisz. To 
generate a series of truncated WSTF constructs for insect cell expression, the PCR 
fragments of the corresponding regions of the WSTF gene were cloned into a 
modified pFASTBac vector (Invitrogen), which contains an N-terminal GST and 
a C-terminal His-tag sequence. The Cys338Ala mutant (1-359) was derived from 
the wild-type 1-359 construct, using the QuickChange site-directed mutagenesis 
kit (Stratagene) and was verified by DNA sequencing. To generate constructs for 
protein expression in E. coli, the PCR product of the N-motif (1-205) was cloned 
into pGex6p vector (GE Healthcare) in-frame with the N-terminal GST tag, and 
the PCR product of the C-motif (208-345) was cloned into a modified 
pRSFDuet-1 vector(Novagen) in-frame with the N-terminal MBP (maltose 
binding protein) tag. To generate constructs for mammalian cell expression, 
the PCR products (the forward and reverse primers contains a BamH1 and 
Xhol site sequence, respectively) of the corresponding fragments of the human 
WSTFE gene were cloned in-between the BamH1 and Xhol sites of pTAG3A 
vectors (Stratagene), in-frame with the N-terminal Myc epitope tag. The tagged 
constructs were cloned into MSCV-puro vectors. 

Generation of WSTF RNAi knockdown cells. WSTF RNAi cell lines were gen- 
erated using recombinant pShag-2 vectors containing short hairpin RNA 
(shRNA) constructs targeting the mouse Wstf gene (Clone ID: V2MM_17087, 
Openbiosystems; hairpin sequence: GCTGTTGACAGTGAGCGCGGAGATAC- 
TTCGATACTTTATTAGTGAAGCCACAGATGTAATAAAGTATCGAAGTA- 
TCTCCTTGCCTACTGCCTCGGA). Three independent lines were generated by 
infecting virus carrying these vectors into NIH3T3 cells and propagated under 
puromycin. In Fig. 4, the WSTF RNA i cells were infected with recombinant MSCV 
viruses carrying the wild-type (1-359) or the mutant (Cys338Ala) constructs. 

Purification of H2A.X-containing mononucleosomes and associated protein 
factors. MEF fractionation and chromatin pellet isolation (Fig. 2) were performed 
as described”. Chromatin pellets were briefly digested with Mnase (Sigma) and 
the generation of mononucleosomes was monitored by electrophoresis**. Fifty 
microlitres of Flag-M2 agarose beads were used for immunoprecipitating 
the total mononucleosomes isolated from 2 X 10° cells. The precipitated complex 
was resolved on 4—12% gradient gels (Invitrogen) and then silver or Coomassie- 
blue stained. The gel bands were isolated and subjected to mass spectrometry. 

Protein identification by mass spectrometry. Gel-resolved protein were 
digested with trypsin, batch purified on a reversed-phase micro-tip, and the result- 
ing peptide pools individually analysed by matrix-assisted laser desorption/ 
ionization reflectron time-of-flight (MALDI-reTOF) mass spectrometry 
(UltraFlex TOF/TOF; Bruker Daltonics) for peptide mass fingerprinting (PMF), 
as described**°. Selected peptide ions (m/z) were taken to search a ‘non- 
redundant’ protein database (‘NR’; 3,245,378 entries on 28 January 2006; 
National Center for Biotechnology Information) using the PeptideSearch algo- 
rithm (M. Mann, Max-Planck Institute for Biochemistry; an updated version of 
this program is at present available as ‘PepSea’ from Applied Biosystems/MDS 
Sciex). A molecular mass range up to twice the apparent molecular weight (as 
estimated from electrophoretic relative mobility) was covered, with a mass accu- 
racy restriction of less than 40 p.p.m., and a maximum of one missed cleavage site 
allowed per peptide. To confirm the PMF results, mass spectrometric sequencing 
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of selected peptides was performed by MALDI-TOF/TOF (MS/MS) analysis on the 
same prepared samples, using the UltraFlex instrument in ‘LIFT’ mode. Fragment 
ion spectra were taken to search the NR database using the MASCOT MS/MS 
Ion Search program”', version 2.0.04 for Windows (Matrix Science Ltd.). Any 
tentative confirmation of a PMF result thus obtained was verified by comparing 
the computer-generated fragment ion series of the predicted tryptic peptide with 
the experimental MS/MS data. 

Immunoblotting and immunofluorescence. Cells at 70-80% confluence were 
exposed to an ionizing radiation source (Cs) to introduce DNA damage (at 
10 Gy). Samples were collected at the indicated time points after irradiation. 
Histone extraction, cell fractionation, high salt nuclear extraction and western 
blotting followed standard protocols’**’. For immunofluorescence, cells were 
grown on coverslips and fixed with 3% paraformaldehyde. The subsequent 
indirect immunofluorescent staining followed standard protocols’. For WSTF 
staining, cells were extracted with a high-salt buffer for 2 min at 4°C before 
fixing, as described**. The antibodies used were: y-H2A.X, anti-general H2A.X 
and anti-BRCA1 (Millipore/Upstate); anti-WSTF and anti-Myc (Sigma); anti- 
ATM Ser 1981(ph) (Rockland immunological); anti-Mdcl, anti-BAF53 and 
anti-SNF2H were gifts from Z. Lou, J. Kim and R. Roeder, and D. Reinberg, 
respectively. 

Recombinant Mdcl protein expression and peptide pull-down assay. GST— 
Mdcl-BRCT protein expression followed manufacturer’s protocols (Stratagene). 
Peptides (UN (unmodified): | N-CPSGGKKATQASQEY-C; — Ser 139(ph): 
N-CPSGGKKATQAPpSQEY-G; Tyr 142(ph): N-CPSGGKKATQASQEpY-C; Ser 139(ph) 
and Tyr 142(ph): N-CPSGGKKATQApSQEpyY-C; Phe 142: N-CPSGGKKATQASQEF-C; 
Leu 142: N-CPSGGKKATQASQEL-C $139(ph), Leu 142: N-CPSGGKKATQApSQEL-C; 
Ser 139(ph), Phe 142: N-CPSGGKKATQApSQEF-C) were conjugated to agarose 
bead with Sulfolink kit (Pierce). Peptide pull-down experiments were performed 
as described previously”. 

Circular dichroism measurement. Circular dichroism (CD) experiments were 
performed on an Aviv 62A/DS (Aviv Associates) spectropolarimeter. Spectra 
(260nm to 200nm) were collected with a cell-path length of 0.1cm and a 
bandwidth of 1nm. Samples were diluted to 104M with 20mM Tris-HCl, 
pH7.5, 100mM KCI, 2mM dithiothreitol. Protein concentrations were deter- 
mined with their 280 nm absorbance. 

Automated chemical (‘Edman’) protein sequencing. Protein samples were 
analysed with a Procise 494 instrument from Applied Biosystems as described”. 
Stepwise liberated phenylthiohydantoin (PTH)-amino acids were identified 
using an ‘online’ HPLC system (Applied Biosystems) equipped with a PTH 
C18 (2.1 X 220 mm; 5 micrometre particle size) column (Applied Biosystems). 
Mapping WSTE kinase domain and preparation of recombinant WSTF 
protein from insect cells. To map the putative kinase domain, we generated a 
series of protein constructs within the N-terminal regions of WSTF. All these 
protein constructs were extensively purified (Fig. 3a, right), and mass spectro- 
metry analysis did not identify any contaminating protein kinases. The 1-345 
construct had more potent kinase activity (>50-fold) than the 1-340 construct 
(Fig. 3e). Furthermore, the substrate specificity of the 1-345 construct is similar 
to the full-length WSTF (Supplementary Fig. 5). These results indicate that the 
amino acid residues in the surrounding regions of 340-345 are critical for the 
kinase activity of WSTF. Consistent with these results, the activity of a point 
mutant at Cys 338 (Cys338Ala) was greatly diminished (<50-fold, Fig. 3f). The 
CD spectrum of this mutant is identical to the wild-type counterpart 
(Supplementary Fig. 6), indicating that this mutation did not alter the global 
structure of the protein. 

Recombinant baculovirus carrying the full-length WSTF (Flag-epitope- 
tagged at C terminus) was produced using the BaculoGold-BEVS kit (BD 
Biosciences). The full-length WSTF proteins were purified using a published 
protocol”. The viruses that express truncated WSTF constructs were produced 
using the Bac-to-Bac (Invitrogen) method, all of which were tagged with an 
N-terminal GST tag and a C-terminal 6X His tag. Baculoviruses were produced 
in Sf9 cells, and proteins were expressed in Hi5 suspension cells. For truncated 
WSTF protein purification, collected Hi5 cells were suspended in 0.4M KCl, 
20mM Tris, pH 7.5, supplemented with the EDTA-free Complete Protease 
Inhibitor Cocktail tablet (Roche). Whole cell lysates were purified through 
GSTrap columns (GE Healthcare). After removal of the GST tags by 
PreScission Protease digestion (GE Healthcare), the protein mixtures were puri- 
fied by HiTrap SP (GE Healthcare). The recombinant WSTF proteins (peaks 
form at ~200 mM KCl, 20 mM Tris, pH 7.5) were further purified with Superdex 
75 (GE Healthcare) in the elution buffer (50 mM KCl, 20 mM Tris, pH 7.5). The 
monomeric peaks (~60 kDa) were collected and concentrated for further use. 
The Cys338Ala mutant protein was expressed and purified as the wild-type 
protein described above. 

Protein preparation in E. coli. The recombinant plasmid vectors carrying the N- 
and C-motif constructs (described earlier) were transformed into E. coli host 
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strain Rosetta2 (DE3) (Novagen) for protein expression under triple antibiotic 
selection (ampicillin, kanamycin and chloramphenicol) in LB medium. After 
overnight induction by 0.4 mM isopropyl B-p-thiogalactoside at 25 °C, cells were 
collected in buffer (0.4 M KCl, 20 mM Tris, pH 7.5). Co-expressed proteins were 
affinity purified using Amylose resin (New England Biolabs) against the MBP 
tag. Maltose-eluted samples were purified through a Superdex 200 column (GE 
Healthcare) in the elution buffer (50mM KCl, 20mM Tris, pH7.5). Most 
proteins were eluted in a single peak that contains GST-tagged WAC N-motif 
and MBP-tagged WAC C-motif with a stoichiometry of 1:1. 

The N- or C-motif were also expressed separately in Rosetta2 (DE3) 

(Novagen) cells in parallel. The N-motif was purified with GSTrap column 
and Superdex 200 and the C-motif was purified with Amylose resin and 
Superdex 200. Most proteins were eluted in a monomeric peak (~50 kDa). 
Histone protein purification and in vitro assembly of histone octamers and 
mononucleosomes. Individual free histone proteins (H2A.X, H2B, H3 and H4) 
were expressed and purified using a standard protocol*®. N-terminal 6X His- 
epitope-tagged wild-type and Phe 142 H2A.X proteins were purified with Ni 
Sepharose6 columns (GE Healthcare). The assembly of the histone octamers 
followed standard protocols**. Four recombinant histone proteins were dena- 
tured in 8M guanidine-HCl solutions and dialysed to 2M NaCl. The crude 
histone octamers were further purified through a Superdex 75 column (GE 
Healthcare). Mononucleosomes were assembled as described*’. The PCR pro- 
ducts of the DNA template (nucleosome positioning sequence (clone 601)**) was 
initially incubated in a reaction buffer*” supplemented with 2M NaCl and 
diluted stepwise in the same reaction buffer until the NaCl concentration 
reached 200 mM. The assembly of mononucleosomes was checked by electro- 
phoresis on 5% native PAGE gels. 
In vitro kinase assays. Recombinant full-length (50-100 ng) or truncated WSTF 
proteins (0.10.5 ug) were incubated with free histone proteins (1-2 jig), histone 
octamers or nucleosomes assembled in vitro (0.5—1ug) in 20mM Tris, pH 7.4 
and 150 mM NaCl supplemented with y-°P-ATP (NEN) and 1 mM MnCl, for 
45 min. In Supplementary Fig. 4, 0.1-1 mM of MnCl, or MgCl, was supplemen- 
ted as indicated. The reaction was stopped by the addition of 2mM EDTA. The 
reaction mixtures were separated by SDS-PAGE, stained with Coomassie blue 
and dried. The dried gels were exposed against phosphorimager screens 
(FujiFilm). The autoradiography images were scanned in phosphorimager 
(FujiFilm) and analysed by Image Reader FLA-5000. 
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A role for self-gravity at multiple length scales in the 


process of star formation 


Alyssa A. Goodman’”, Erik W. Rosolowsky””, Michelle A. Borkin'+, Jonathan B. Foster, Michael Halle’, 


Jens Kauffmann’? & Jaime E. Pineda? 


Self-gravity plays a decisive role in the final stages of star forma- 
tion, where dense cores (size ~0.1 parsecs) inside molecular clouds 
collapse to form star-plus-disk systems’. But self-gravity’s role at 
earlier times (and on larger length scales, such as ~1 parsec) is 
unclear; some molecular cloud simulations that do not include 
self-gravity suggest that ‘turbulent fragmentation’ alone is suf- 
ficient to create a mass distribution of dense cores that resembles, 
and sets, the stellar initial mass function’. Here we report a ‘den- 
drogram’ (hierarchical tree-diagram) analysis that reveals that 
self-gravity plays a significant role over the full range of possible 
scales traced by '°CO observations in the L1448 molecular cloud, 
but not everywhere in the observed region. In particular, more 
than 90 per cent of the compact ‘pre-stellar cores’ traced by peaks 
of dust emission’ are projected on the sky within one of the den- 
drogram’s self-gravitating ‘leaves’. As these peaks mark the loca- 
tions of already-forming stars, or of those probably about to form, 
a self-gravitating cocoon seems a critical condition for their exist- 
ence. Turbulent fragmentation simulations without self-gravity— 
even of unmagnetized isothermal material—can yield mass and 
velocity power spectra very similar to what is observed in clouds 
like L1448. But a dendrogram of such a simulation* shows that 
nearly all the gas in it (much more than in the observations) 
appears to be self-gravitating. A potentially significant role for 
gravity in ‘non-self-gravitating’ simulations suggests inconsistency 
in simulation assumptions and output, and that it is necessary to 
include self-gravity in any realistic simulation of the star-formation 
process on subparsec scales. 

Spectral-line mapping shows whole molecular clouds (typically 
tens to hundreds of parsecs across, and surrounded by atomic gas) 
to be marginally self-gravitating’. When attempts are made to further 
break down clouds into pieces using ‘segmentation’ routines, some 
self-gravitating structures are always found on whatever scale is 
sampled®’. But no observational study to date has successfully used 
one spectral-line data cube to study how the role of self-gravity varies 
as a function of scale and conditions, within an individual region. 

Most past structure identification in molecular clouds has been 
explicitly non-hierarchical, which makes difficult the quantification 
of physical conditions on multiple scales using a single data set. 
Consider, for example, the often-used algorithm CLUMPFIND’. In 
three-dimensional (3D) spectral-line data cubes, CLUMPFIND oper- 
ates as a watershed segmentation algorithm, identifying local maxima 
in the position—position—velocity (p—p—v) cube and assigning nearby 
emission to each local maximum. Figure | gives a two-dimensional 
(2D) view of L1448, our sample star-forming region, and Fig. 2 
includes a CLUMPFIND decomposition of it based on '*CO observa- 
tions. As with any algorithm that does not offer hierchically nested or 


overlapping features as an option, significant emission found between 
prominent clumps is typically either appended to the nearest clump or 
turned into a small, usually ‘pathological’, feature needed to encom- 
pass all the emission being modelled. When applied to molecular-line 


Figure 1| Near-infrared image of the L1448 star-forming region with 
contours of molecular emission overlaid. The channels of the colour image 
correspond to the near-infrared bands J (blue), H (green) and K (red), and 
the contours of integrated intensity are from '*CO(1-0) emission’. 
Integrated intensity is monotonically, but not quite linearly (see 
Supplementary Information), related to column density"’, and it gives a view 
of ‘all’ of the molecular gas along lines of sight, regardless of distance or 
velocity. The region within the yellow box immediately surrounding the 
protostars has been imaged more deeply in the near-infrared (using Calar 
Alto) than the remainder of the box (2MASS data only), revealing protostars 
as well as the scattered starlight known as ‘Cloudshine™ and outflows 
(which appear orange in this colour scheme). The four billiard-ball labels 
indicate regions containing self-gravitating dense gas, as identified by the 
dendrogram analysis, and the leaves they identify are best shown in Fig. 2a. 
Asterisks show the locations of the four most prominent embedded young 
stars or compact stellar systems in the region (see Supplementary Table 1), 
and yellow circles show the millimetre-dust emission peaks identified as star- 
forming or ‘pre-stellar’ cores’. 
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Interactive version of Fig. 2a, b online 
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Figure 2 | Comparison of the ‘dendrogram’ and ‘CLUMPFIND’ feature- 
identification algorithms as applied to '*CO emission from the L1448 
region of Perseus. a, 3D visualization of the surfaces indicated by colours in 
the dendrogram shown in c. Purple illustrates the smallest scale self- 
gravitating structures in the region corresponding to the leaves of the 
dendrogram; pink shows the smallest surfaces that contain distinct self- 
gravitating leaves within them; and green corresponds to the surface in the 
data cube containing all the significant emission. Dendrogram branches 
corresponding to self-gravitating objects have been highlighted in yellow 
over the range of T,,, (main-beam temperature) test-level values for which 
the virial parameter is less than 2. The x—y locations of the four ‘self- 
gravitating’ leaves labelled with billiard balls are the same as those shown in 
Fig. 1. The 3D visualizations show position—position—velocity (p—p—v) space. 
RA, right ascension; dec., declination. For comparison with the ability of 
dendrograms (c) to track hierarchical structure, d shows a pseudo- 
dendrogram of the CLUMPFIND segmentation (b), with the same four 
labels used in Fig. 1 and in a. As ‘clumps’ are not allowed to belong to larger 
structures, each pseudo-branch in d is simply a series of lines connecting the 
maximum emission value in each clump to the threshold value. A very large 
number of clumps appears in b because of the sensitivity of CLUMPFIND to 
noise and small-scale structure in the data. In the online PDF version, the 3D 
cubes (a and b) can be rotated to any orientation, and surfaces can be turned 
on and off (interaction requires Adobe Acrobat version 7.0.8 or higher). In 
the printed version, the front face of each 3D cube (the ‘home’ view in the 
interactive online version) corresponds exactly to the patch of sky shown in 
Fig. 1, and velocity with respect to the Local Standard of Rest increases from 
front (—0.5kms_') to back (8kms7!). 


data, CLUMPFIND typically finds features on a limited range of scales, 
above but close to the physical resolution of the data, and its results can 
be overly dependent on input parameters. By tuning CLUMPFIND’s 
two free parameters, the same molecular-line data set* can be used to 
show either that the frequency distribution of clump mass is the same 
as the initial mass function of stars or that it follows the much shal- 
lower mass function associated with large-scale molecular clouds 
(Supplementary Fig. 1). 

Four years before the advent of CLUMPFIND, ‘structure trees” 
were proposed as a way to characterize clouds’ hierarchical structure 
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using 2D maps of column density. With this early 2D work as inspira- 
tion, we have developed a structure-identification algorithm that 
abstracts the hierarchical structure of a 3D (p—p—v) data cube into 
an easily visualized representation called a ‘dendrogram’"®. Although 
well developed in other data-intensive fields'’’, it is curious that the 
application of tree methodologies so far in astrophysics has been rare, 
and almost exclusively within the area of galaxy evolution, where 
‘merger trees’ are being used with increasing frequency”. 

Figure 3 and its legend explain the construction of dendrograms 
schematically. The dendrogram quantifies how and where local max- 
ima of emission merge with each other, and its implementation is 
explained in Supplementary Methods. Critically, the dendrogram is 
determined almost entirely by the data itself, and it has negligible 
sensitivity to algorithm parameters. To make graphical presentation 
possible on paper and 2D screens, we ‘flatten’ the dendrograms of 3D 
data (see Fig. 3 and its legend), by sorting their ‘branches’ to not 
cross, which eliminates dimensional information on the x axis while 
preserving all information about connectivity and hierarchy. 
Numbered ‘billiard ball’ labels in the figures let the reader match 
features between a 2D map (Fig. 1), an interactive 3D map (Fig. 2a 
online) and a sorted dendrogram (Fig. 2c). 

A dendrogram ofa spectral-line data cube allows for the estimation 
of key physical properties associated with volumes bounded by iso- 
surfaces, such as radius (R), velocity dispersion (o,) and luminosity 
(L). The volumes can have any shape, and in other work"* we focus on 
the significance of the especially elongated features seen in L1448 
(Fig. 2a). The luminosity is an approximate proxy for mass, such 
that Mium = XiscoLlisco, where Xi3co = 8.0 X 107?cm?K 'km™'s 
(ref. 15; see Supplementary Methods and Supplementary Fig. 2). 
The derived values for size, mass and velocity dispersion can then be 
used to estimate the role of self-gravity at each point in the hierarchy, 
via calculation of an ‘observed? virial parameter, p53 = 50,’ R/ GMyum- 
In principle, extended portions of the tree (Fig. 2, yellow highlighting) 
where &,; < 2 (where gravitational energy is comparable to or larger 
than kinetic energy) correspond to regions of p—p—v space where self- 
gravity is significant. As %,,, only represents the ratio of kinetic energy 
to gravitational energy at one point in time, and does not explicitly 
capture external over-pressure and/or magnetic fields'®, its measured 
value should only be used as a guide to the longevity (boundedness) of 
any particular feature. 
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Figure 3 | Schematic illustration of the dendrogram process. Shown is the 
construction of a dendrogram from a hypothetical one-dimensional 
emission profile (black). The dendrogram (blue) can be constructed by 
‘dropping’ a test constant emission level (purple) from above in tiny steps 
(exaggerated in size here, light lines) until all the local maxima and mergers 
are found, and connected as shown. The intersection of a test level with the 
emission is a set of points (for example the light purple dots) in one 
dimension, a planar curve in two dimensions, and an isosurface in three 
dimensions. The dendrogram of 3D data shown in Fig. 2c is the direct 
analogue of the tree shown here, only constructed from ‘isosurface’ rather 
than ‘point’ intersections. It has been sorted and flattened for representation 
on a flat page, as fully representing dendrograms for 3D data cubes would 
require four dimensions. 
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In calculating «p,, we are implicitly assuming that there is a one- 
to-one relationship (known as a ‘bijection’) between a volume in 
p-p-v space and a volume of physical (position—position—position, 
p-p-p) space. This bijection paradigm is fine for regions which are 
dominated by a single structure, but the complexities of relating p—p- 
v space to physical space in regions with multiple features along a line 
of sight does mean that this treatment can only ever give an approx- 
imate measure of the true dynamical state of the cloud’’. Alternatives 
to bijection are considered in the Supplementary Information. The 
bijection assumption comes into play when measuring physical 
properties of individual features, but it does not influence the char- 
acterization of hierarchical structure. 

In Fig. 2c, we show the dendrogram for the same L1448 '°CO 
spectral-line map shown using contours in Fig. 1. All of the portions 
shaded yellow have %,p; < 2, meaning that they are (most) likely to be 
self-gravitating. The four most compact p—p-v structures (leaves) 
where % ps <2 are numbered in Figs 1 and 2, and they are not as 
apparent in the projected (2D) view (Fig. 1) as they are in p-p-v (3D) 
space (Fig. 2a). In the CLUMPFIND decomposition of the cloud 
(Fig. 2b), these features are not apparent as special. 

Overall, the pattern of yellow highlighting in Fig. 2 suggests the 
importance of gravity on all possible scales, but not within the full 
possible volume, in a cloud like L1448. With the exception of the gas 
around region 4, which appears not to be bound to the rest of L1448, 
the tree shows a fully yellow-highlighted ‘trunk’ and only sporadic 
highlighting on the dendrogram’s tallest branches and leaves. So 
for the material traced by '°CO observations, it appears that self- 
gravitating structures are more prevalent on larger scales than on 
smaller. At densities surpassing 5X 10°cm~*, '°CO becomes an 
increasingly poor tracer of mass'*, so it can only give upper limits 
for the ‘true’ virial parameters of the densest, most compact, structures 
seen in the dendrogram. Thus, the highest-density non-yellow leaves 
in Fig. 2c may harbour bound structures only visible with thinner or 
less-depleted molecular lines. On the other hand, lower-density non- 
yellow leaves in Fig. 2c probably represent actual low-mass unbound 
structures in the gas, similar to the ‘pressure-confined’ low-mass 
clumps found in clump-based segmentations. Importantly, the full 
pattern of highlighting explicitly indicates that core-like leaves often 
reside within structures where the mutual gravity between the cores 
(leaves) and/or their environs (branches) is significant enough to 
cause meaningful interactions between cores—possibly even, in the 
most extreme cases, competitive accretion. Recent work"® has shown 
that the overall (column) density distribution of material traced by 
CO ina 10-pc-scale molecular cloud is roughly log-normal, and our 
result here implies that some of the high-density fluctuations in that 
statistical distribution are bound within themselves and/or to each 
other, and some not. 

Tree hierarchies can be used to intercompare the topology and 
physical properties (for example boundedness) of structures within 
star-forming regions, and such intercomparison can be profitably 
extended to simulations as well. In Fig. 4, we summarize such a 
comparison (see Supplementary Information) with a plot showing 
the fraction of ‘self-gravitating’ (~%,, <2) material as a function of 
spatial scale for both our L1448 data and for a synthetic data cube’. 
The simulation used to produce the synthetic data is purely hydro- 
dynamic, meaning that the effects of magnetic fields, heating and 
cooling, and self-gravity are not included. The power-law exponent 
characterizing the power spectrum of turbulence in these synthetic 
*8CO data and in the COMPLETE Perseus data® (from which our 
L1448 example is drawn) is ~1.8, to within small uncertainties 
(~0.2; ref. 4). However, inspection of Fig. 4 (and of Supple- 
mentary Fig. 4) clearly shows that the data and simulation appear 
quite different in the context of dendrogram analysis: in the simu- 
lation, nearly all material (much more than in the observations) is 
self-gravitating, on all spatial scales. Critically, the analysis of the 
synthetic '*CO cube* (Supplementary Fig. 4) is done on a simulated 
observation of it where we have deliberately matched resolution, 
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Figure 4 | The fraction of self-gravitating emission as a function of scale in 
L1448 and a comparable simulation. Most of the emission in the L1448 
region is contained with large-scale self-gravitating structures, but only alow 
fraction of small-scale objects show signs of self-gravitation. (See text for 
discussion of the high-density, small-scale, self-gravitating structures to 
which '°CO is insensitive.) In the L1448 observations, gravity is significant 
on all scales, but not in all regions. In contrast, the simulated map implies 
that nearly all scales, and all regions, should be influenced by gravity. 


noise properties and region extent to the L1448 cube (Supple- 
mentary Methods). The (constant) abundance of SCO used for the 
synthetic map (Supplementary Information) is set to match the 
known column densities in the simulation, and because abundance 
is simply a multiplicative constant, changing it cannot reproduce the 
scale dependence of gravity found in the L1448 data. 

Thus it appears that the synthetic data cube created from the 
simulation* contains much material that would be significantly affec- 
ted by gravity, if gravity were actually included in the simulation. 

The accuracy with which dendrograms can offer estimates of oop is 
at or below the 25% level (Supplementary Information). The uncer- 
tainty results primarily from the need to glean a 3D geometry and 
density based on 2D size and column density (mass/area), and any 
analysis of p—p—v data will be subject to the same limitations. More 
analysis, using simulations, of the translation from p—p-v to p-—p-p 
space” should be, and is being, carried out to quantify these uncer- 
tainties more finely. Comparative measurements (for example Fig. 4) 
are far more certain as these biases should affect all data sets similarly. 
Thus, the apparent disagreement between observations and simu- 
lation in Fig. 4 can be explained by claiming that either, or both, of 
the following are true: (1) the assumptions/calculations leading to the 
creation of the synthetic '*CO observations are faulty; or (2) there is 
missing physics in the simulation (for example gravity, thermal 
effects), making it an insufficient approximation to real star-forming 
regions. 

Finally, we turn to the relationship between the apparently ‘self- 
gravitating’ regions in L1448 and the star-formation process itself. 
Compact millimetre-wavelength emission peaks caused by dust 
emission (marked by yellow circles in Fig. 1) are typically taken as 
markers of cores that are forming, or are able to form, stars. Within 
the region of L1448 considered here, more than 90% of the compact 
millimetre-dust peaks traced in bolometer observations’ are found 
projected on the sky within one of the dendrogram’ ‘self-gravitating’ 
leaves, and none is found outside a self-gravitating branch. Recent 
NH; observations’” suggest that all, or all but one, of these ‘pre-stellar 
cores’ lie within self-gravitating structures along the velocity dimen- 
sion as well'*. As young sources get a little older, they can be detected 
in the mid-infrared (IRAC) bands of the Spitzer Space Telescope. 
Four out of the five sources identified by such IRAC imaging as 
protostar candidates” also lie within a leaf, and each of those four 
is associated with a millimetre-dust peak, suggesting they are embed- 
ded in dense natal cocoons. Interestingly, the one IRAC protostar 
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candidate in the region not associated with a self-gravitating leaf is 
also not associated with a millimetre-dust peak, suggesting it is a 
more evolved source. All told, these associations suggest that a self- 
gravitating home is critical to the earliest phases of star formation. 
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Squeezing and over-squeezing of triphotons 


L. K. Shalm’, R. B. A. Adamson’ & A. M. Steinberg’ 


Quantum mechanics places a fundamental limit on the accuracy of 
measurements. In most circumstances, the measurement uncer- 
tainty is distributed equally between pairs of complementary pro- 
perties; this leads to the ‘standard quantum limit’ for measurement 
resolution. Using a technique known as ‘squeezing’, it is possible to 
reduce the uncertainty of one desired property below the standard 
quantum limit at the expense of increasing that of the complement- 
ary one. Squeezing is already being used to enhance the sensitivity of 
gravity-wave detectors’ and may play a critical role in other high 
precision applications, such as atomic clocks” and optical commu- 
nications*. Spin squeezing (the squeezing of angular momentum 
variables) is a powerful tool, particularly in the context of quantum 
light—matter interfaces*°. Although impressive gains in squeezing 
have been made, optical spin-squeezed systems are still many orders 
of magnitude away from the maximum possible squeezing, known 
as the Heisenberg uncertainty limit. Here we demonstrate how an 
optical system can be squeezed essentially all the way to this fun- 
damental bound. We construct spin-squeezed states by overlapping 
three indistinguishable photons in an optical fibre and manipu- 
lating their polarization (spin), resulting in the formation of a 
squeezed composite particle known as a ‘triphoton’. The symmetry 
properties of polarization imply that the measured triphoton states 
can be most naturally represented by quasi-probability distribu- 
tions on the surface of a sphere. In this work we show that the 
spherical topology of polarization imposes a limit on how much 
squeezing can occur, leading to the quasi-probability distributions 
wrapping around the sphere—a phenomenon we term ‘over- 
squeezing’. Our observations of spin-squeezing in the few-photon 
regime could lead to new quantum resources for enhanced mea- 
surement, lithography and information processing that can be pre- 
cisely engineered photon-by-photon. 

The standard way to characterize and represent the polarization of 
classical light is through the use of a Stokes vector. The vector’s 
components are the Stokes parameters S,, S. and S; that describe 
the degree of linear, diagonal and circular polarization respectively, 
while the intensity of the beam is represented by the Stokes parameter 
So- Si, Sy and S; form a Cartesian coordinate system, and for polarized 
light $7 + Sy + S37 = So. Such a Stokes vector terminates at a point 
on the surface of what is known as the Poincaré sphere. For quantum 
polarization states, a Poincaré sphere can be constructed where the 
Stokes parameters are replaced by the Stokes operators: 


So ay ay ay ay ny + ny N > 

a + + s x 

S} = ayy — ay ay = Ny — fy, 

fe ee (1) 
Sy = Ay ay + ay dy = Mp — Na, 


vai + os 
S3 = (ay dH — a 4y) = fg — ML, 


where Nis the total photon number and a;', a;andn; are the creation, 
annihilation and photon number operators for polarization mode i, 


and where i takes the values H, V, D, A, R and L corresponding to 
horizontal, vertical, diagonal, anti-diagonal, right-hand circular and 
left-hand circular polarizations, respectively”. The Stokes operators 
do not commute with one another, leading to the Heisenberg uncer- 


A 2 A 
(S - | El.m,.n Where V) represents the variance 


tainty relations Vj V;, > 


in the Stokes operator $). This uncertainty implies that rather than 
assuming definite values, the Stokes parameters must be described by 
a joint quasi-probability distribution. A Wigner distribution on the 
Poincaré sphere can be introduced following the approach of ref. 10 
to provide an informationally complete description of the polariza- 
tion state space, and to calculate any of the desired properties of the 
state''"'*. It is important to note that for our triphoton states N is 
fixed at three and hence Sp is a constant without fluctuations. 

The state of a single polarized photon can be completely described 
by two parameters (relative amplitude and phase of the two ortho- 
gonal polarization modes). Because there are only two parameters, 
once the centre of the distribution (a point on the surface of the 
Poincaré sphere) is specified, no more freedom remains. All pure 
states are described by the same circularly symmetric quasi-probabil- 
ity distribution centred at a point. This is akin to the Bloch sphere for 
a spin-1/2 particle, where a pure state is completely identified by a 
Bloch vector. In general, N photons polarized in the same direction 
form a spin-coherent state, which is the analogue of the Glauber 
coherent state of a harmonic oscillator’. The quasi-probability dis- 
tribution of the spin-coherent state has circular symmetry and can be 
represented by a single Stokes vector that terminates at the centre of 
the distribution, as in the single-particle case. The uncertainty in the 
Stokes operators of this two-level system scales as \/N, set by bino- 
mial statistics due to the independence of measurements on indi- 
vidually polarized (unentangled) photons. 

For a general quantum state formed by a collection of N photons in 
the same mode, 2N degrees of freedom are required to characterize the 
state, and a simple vector is no longer sufficient to describe the polar- 
ization on the Poincaré sphere. In such a state, the photons do not 
have individual identities and it becomes impossible to speak about 
the polarization of a single photon. Instead, the entire system must be 
treated as a composite particle of spin N/2 and more complex distri- 
butions that break the circular symmetry of the simple two-level sys- 
tem are allowed. In such distributions, polarization squeezing 
becomes evident as the uncertainty in one Stokes operator is decreased 
below the shot-noise limit of /N at the expense of an increase in 
uncertainty of the conjugate operator. Previous work on optical 
spin-squeezing has used such distributions, but always in a limit where 
the sphere could be approximated by a tangent plane>’. In these 
experiments, the uncertainty distributions are squeezed by ~ 100 brad 
on the surface of the sphere. In contrast, the squeezing we measure in 
our experiment is profoundly affected by the topology of the Poincaré 
sphere. When the degree of squeezing becomes large enough, over- 
squeezing occurs. When a triphoton is over-squeezed, the topology of 
the sphere causes the quasi-probability distribution to wrap around 
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itself, and further squeezing leads to a counter-intuitive increase in the 
uncertainty of the Stokes operator being squeezed. 

The triphoton states we study are of the form |f)= 
63|3,0) yy + [2.1 ) py + 11,2) yy +63/0,3)yy, where |m,n)yy 
represents m and n photons in the orthogonal horizontal (H) and 
vertical (V) polarization modes respectively. To experimentally pro- 
duce these triphoton states, we place a pair of photons generated by 
type-II spontaneous parametric downconversion (SPDC) and a 
single photon from an attenuated local oscillator (LO) into the same 
spatial, temporal and spectral mode following the method of refs 16- 
18, as shown in Fig. 1. Because the photons do not come from true 
single-photon sources, unavoidable background terms occur, as dis- 
cussed further in the Supplementary Information. To minimize 
background events, the pair generation rate and the local oscillator 
rate are carefully balanced to ensure that the majority of three- 
photon events detected originate from a single SPDC pair and a single 
LO photon. Full quantum state tomography is carried out on the 
triphoton states using the hidden differences technique’”’, and 
the resulting density matrix used to calculate the Wigner quasi- 
probability distributions on the Poincaré sphere (as discussed further 
in the Supplementary Information). A total of 11 different triphoton 
states were prepared and measured as shown in Fig. 1 by varying the 
squeezing factor T from 0 to 1.7 (see Methods for further details). 
Here T = Ty/ Ty, the ratio of the transmissivity of the horizontal (Tj;) 
and the vertical (Ty) modes of the variable partial polarizer. The 
Wigner quasi-probability distributions for five of these states are 
shown in Fig. 2a. 

When T= 0, the partial polarizer only transmits the horizontal 
polarization, producing a quasi-classical coherent state composed of 
three horizontally polarized photons. A quarter-wave plate then 
rotates the coherent state into the circular basis. On the Poincaré 
sphere, this coherent state has a circularly symmetric quasi-probability 
distribution centred on the $3 axis (as shown in Fig. 2b), and an 
uncertainty in S, and S, given by the shot-noise limit. 

Setting T=0.7 produces the phase state |) = $(|3,0),,y+ 
[2,1)yy +|1.2) qv +10,3)y.y), which represents an equally weighted 
superposition of polarization states. On the Poincaré sphere, the 
Wigner distribution is centred on the $3 axis (as shown in Fig. 2c), 
but now takes a ‘banana’ shape as the uncertainty along Sy decreases 
below the shot-noise limit while the uncertainty in S; increases above 
it. The phase state has been called a ‘jack-of-all-trades’ state’ for 
metrology, as its symmetry and squeezing properties allow it to 
accurately acquire and then track an unknown phase. 

Finally, tuning the partial polarizer to T= 1.7 ~ V3 creates the 
state |C) = a (IN. sv + |.) nv) , where for us N=3. Such states 


are called ‘NOON’ states and are capable of reaching the Heisenberg 
limit in phase-sensitive measurements and also of achieving super- 
resolution in lithographic applications’®**°. Our NOON state has a 
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SPDC photons 
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fidelity of 0.68 + 0.01 with an ideal NOON state. This low fidelity 
stems from background events in our state preparation and the fact 
that the three photons are not completely indistinguishable. The 
major background contributions are due to either two SPDC pairs 
or three LO photons triggering a threefold coincidence. To account 
for the background contributions, state tomography was indepen- 
dently carried out on each major background source (by alternately 
blocking the LO and SPDC photons) and then subtracted off the 
measured density matrix, yielding a fidelity with an ideal NOON state 
of 0.80 + 0.01. This background-subtracted NOON state still contains 
smaller background contributions that make up ~ 10% of the mea- 
sured state (as further discussed in the Supplementary Information). 
Because the three photons are not generated by the same process, 
there is also unavoidable distinguishing information present in the 
state. This distinguishing information requires more parameters, and 
therefore a larger density matrix, to fully specify the polarization of 
the state. The extra elements in this density matrix represent the 
entanglement of the triphoton state with the environment, and pro- 
vide a lower bound on the amount of distinguishing information 
present’. With the development of ideal single-photon sources, both 
the unwanted background contributions and the residual distin- 
guishing information in our states will be eliminated and our pro- 
tocol can be scaled beyond three photons. 

In the metrology and lithography applications for which NOON 
states have been proposed, the detection schemes rely on N-photon 
absorbers that are sensitive only to symmetric states. Because the extra 
elements in the density matrix resulting from distinguishing informa- 
tion represent states of mixed symmetry, they will be automatically 
filtered out by the detectors in any practical implementation. After 
performing such filtering and accounting for the major background 
contributions (as further discussed in the Supplementary 
Information), our NOON state has a fidelity of 0.89 + 0.01 with an 
ideal state. As shown in Fig. 2d, the Wigner quasi-probability distri- 
bution is peaked at the poles of the S; axis, corresponding to finding all 
three photons being either horizontally or vertically polarized, and 
displays a threefold symmetry around the equator. This threefold 
symmetry is directly related to the threefold enhancement in phase 
super-resolution of this NOON state'®, and the minimum feature 
width corresponds to the Heisenberg-limited phase super-sensitivity 
possible in interferometry using NOON states**”*. 

To quantify the degree of entanglement in our states, we calculate 
the negativityN(p)= >> max (0, H;), a measure related to the Peres— 


j 
Horodecki criterion’””’, where the js are the eigenvalues of the 


partial transpose p’’ of the density matrix p. Any positive value of 
the negativity indicates entanglement, with maximally entangled 
states such as the NOON state having N(p) = 1. Using the negativity, 
we can determine the bipartite entanglement between any single 
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Figure 1| Overview of the experiment. A pair of downconverted photons 
from type-II spontaneous parametric downconversion (SPDC) are placed 
into the same mode with a polarizing beamsplitter (PBS). A half-wave plate 
(HWP) rotates the photons into the diagonal basis, and a third photon from 
a local oscillator (LO) is combined into the same mode using a laser window. 
A variable partial polarizer (VPP) is used to manipulate the polarization of 
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the photons. The resulting triphotons are coupled into a single mode fibre 
(SME) that is polarization maintaining (PM) and spectrally filtered using an 
interference filter (IF). Quantum state tomography is carried out by using a 
polarization analyser consisting of two quarter-wave plates (QWP), one 
HWP and a PBS, and threefold coincidence counts are collected using 
coincidence circuitry and single-photon counting modules (SPCMs). 
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Figure 2 | Triphoton Wigner quasi-probability distributions on the 
Poincaré sphere. a, Wigner quasi-probability distributions for five 
triphoton states with different squeezing parameters T. b—d, Two- 
dimensional projections (top) and an equal-area cylindrical map projection 
(bottom) of the quasi-probability distributions for a coherent state (b), phase 
state (c) and a background-subtracted NOON state (d). In b, the coherent 


photon and the rest of the system. Starting with the coherent state at 
T= 0 with a negativity of 0.05 + 0.03, the degree of entanglement 
steadily increases until the background-corrected NOON state is 
reached, which has a negativity of 0.52 + 0.03. 

The uncertainty in the Stokes operators is measured for each of the 
11 states created, as shown in Fig. 3. Starting with the coherent state at 
T= 0 and continuing to the state with T'= 1, the uncertainty in S, 
decreases below the shot-noise limit while the degree of squeezing 
increases. However, after T= 1, the states begin to over-squeeze, as 
further squeezing leads to an increase in the uncertainty of both $; 
and S). As the quasi-probability distributions wrap around the sur- 
face of the Poincaré sphere, the three photons become more entang- 
led but appear to depolarize with respect to the Stokes operators. The 
NOON state is maximally entangled and contains perfect correlations, 
yet appears completely unpolarized to intensity measurements. 
Unlike mixed states, which appear as broad probability distributions 
on the surface of the Poincaré sphere (with the maximally mixed state 
represented by a uniform distribution), the NOON state distribution 
contains the sharpest features possible due to the quantum correla- 
tions that exist. This is analogous to the maximally entangled 
Greenberger—Horne—Zeilinger (GHZ) state where any single mem- 
ber appears mixed, yet perfect correlations exist between all particles. 
For both the NOON and the GHZ states, these correlations only 
become evident when performing multi-particle measurements 
using, for example, number-resolving detectors. 

In our experiment, the presence of background terms leads to a 
reduction in the purity and amount of squeezing possible. As dis- 
cussed in the Supplementary Information, the background terms 
are polarized and add incoherently to the desired triphoton state. It 
is interesting to note that the background-subtracted NOON state has a 
larger uncertainty in S) than the non-subtracted case. This is because 
the background contributions in the state at T= 1.7 reduce the 
amount of squeezing present, but the state is still within the over- 
squeezing regime and therefore a decrease in uncertainty is measured. 
Once the dominant background contributions are experimentally 
measured and removed, the purity increases and the quantum corre- 
lations become stronger, leading to an increase in the uncertainty of 


state has a uniform distribution centred about the $3 axis. As the squeezing 
parameter is increased to produce a phase state, the width of the angular 
features decreases until the minimum feature size is reached in d with the 
maximally-squeezed NOON state. In both the phase state and NOON state, 
large regions of quasi-probability distribution take on negative values. The 
colour scale at right applies to all panels. 


S,. When higher order moments are measured, the background- 
subtracted NOON state exhibits phase resolution and sensitivity super- 
ior to those of the non-background-subtracted state. Squeezed states 
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Figure 3 | The uncertainty in the Stokes parameters S, and S> for 11 
squeezed triphoton states. As the squeezing parameter is increased to 

T = 1, the uncertainty in S, decreases. After T = 1, the topology of the sphere 
causes the triphoton states to over-squeeze, and the uncertainty in S) begins 
to increase again as the state becomes depolarized with respect to single- 
particle intensity measurements. Insets show the Wigner quasi-probability 
distributions for the spin-coherent state T = 0 (top left), the phase state 

T = 0.7 (bottom left), and the background-corrected NOON state T = 1.7 
(bottom right). For the NOON state, both the side and top views are shown to 
illustrate how the over-squeezing leads to an increase in the uncertainty of 
S). Solid lines indicate theory, crosses are experimental data points. The 
latter each have an error of less than 2%. The discrepancy between the 
measured states and theory is due to background contributions. Filled circles 
are the uncertainties in the NOON state after corrections for background 
events and distinguishing information have been made. 
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are often studied in the context of improving phase measurements 
with sub-shot-noise sensitivity. However, for highly squeezed states in 
the over-squeezing limit (such as the NOON state), multi-photon 
detectors as opposed to intensity-based measurements are required 
to take advantage of the enhanced precision offered. 

This is the first time that spin-squeezing in the few-photon limit 
has been observed. Recently, there have also been studies of atomic 
spin-squeezed states which implicitly contain over-squeezing*”, but 
our work represents the first time that the phenomenon of over- 
squeezing and its relation to quantum metrology has been directly 
investigated. It unifies the concepts of spin-squeezing of intense 
beams and multi-photon Heisenberg-limited metrology, highlight- 
ing the deep connection between squeezing and entanglement. This is 
indicated in the maximally squeezed NOON state, where no single- 
particle information remains and all the information resides purely 
in non-classical correlations. With improvements in single-photon 
sources, this relationship between squeezing and entanglement could 
be further exploited to engineer custom non-classical states that 
would advance key quantum technologies, including quantum com- 
putation, lithography and cryptography. 


METHODS SUMMARY 


To produce our triphoton states, we start with a pulsed 40 MHz Ti:sapphire laser 
operating at 810nm, which produces 100-fs pulses that are then frequency- 
doubled using a 0.5-mm-thick B-barium borate (BBO) crystal to produce 
50mW of pulsed light at 405 nm. These pulses are focused into a 0.5-mm BBO 
crystal, phase-matched for type-II spontaneous parametric downconversion with 
beam-like emission”’ to generate a pair of photons that are combined into the same 
mode using a polarizing beamsplitter. These two photons are then sent through a 
half wave-plate at 22.5° to produce the state a};a'_4;|0) =(aj; —ai?)|0) where H 
and V are the horizontal and vertical polarization modes, respectively. Next, the 
downconverted pair of photons is combined with a horizontally polarized laser 
photon on a highly transmissive beamsplitter to produce the unnormalized state 
aja’ 454);|0) =(a}; —a\~)aj,|0). This state is then sent to a variable partial polar- 
izer where either the horizontal or vertical polarization component can be atte- 
nuated and the polarization state of the three photons controlled. The action of the 
variable partial polarizer is to squeeze the polarization distribution of the photons, 
producing the state |C)=ye~% "al. 


ta! 4sa);|0), where 7 is a normalization 
factor and T= Ty/Ty is the squeezing factor given by the ratio of the transmissivity 
of the horizontal T,; and the vertical Ty modes of the partial polarizer. Following 
the partial polarizer, a quarter-wave plate is used to rotate the state from the 
circular to linear polarization basis. A 12 nm spectral filter centred at 810nm 
and a single-mode fibre are used to remove spectral and spatial distinguishing 
information. 

A polarization analyser, consisting of a quarter-wave plate followed by a fibre 
polarization beamsplitter and four single-photon detectors, is used to carry out 
quantum state tomography as further discussed in the Supplementary 
Information. It is impossible to combine photons with non-orthogonal polar- 
izations into the same mode with certainty; therefore we use post-selection to 
study only the cases where all three photons are detected in the same mode. 
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Optical manipulation of nanoparticles and 
biomolecules in sub-wavelength slot waveguides 


Allen H. J. Yang’, Sean D. Moore’, Bradley S. Schmidt®, Matthew Klug’, Michal Lipson® & David Erickson? 


The ability to manipulate nanoscopic matter precisely is critical for 
the development of active nanosystems. Optical tweezers'* are 
excellent tools for transporting particles ranging in size from several 
micrometres to a few hundred nanometres. Manipulation of dielec- 
tric objects with much smaller diameters, however, requires stron- 
ger optical confinement and higher intensities than can be provided 
by these diffraction-limited* systems. Here we present an approach 
to optofluidic transport that overcomes these limitations, using sub- 
wavelength liquid-core slot waveguides’. The technique simulta- 
neously makes use of near-field optical forces to confine matter 
inside the waveguide and scattering/adsorption forces to transport 
it. The ability of the slot waveguide to condense the accessible elec- 
tromagnetic energy to scales as small as 60 nm allows us also to 
overcome the fundamental diffraction problem. We apply the 
approach here to the trapping and transport of 75-nm dielectric 
nanoparticles and 1-DNA molecules. Because trapping occurs along 
a line, rather than at a point as with traditional point traps”*, the 
method provides the ability to handle extended biomolecules 
directly. We also carry out a detailed numerical analysis that relates 
the near-field optical forces to release kinetics. We believe that the 
architecture demonstrated here will help to bridge the gap between 
optical manipulation and nanofluidics. 

Ina series of recent articles, Psaltis et al.? and Monat et al.'° describe 
the enormous potential of integrating optical and microfluidic ele- 
ments into ‘lab-on-a-chip’ devices, particularly in improving fluid 
and particle manipulations. Traditionally accomplished through 
direct particle manipulation with laser tweezers'*"'' or indirectly 
using optically induced microfluidic effects'”'’, the precision with 
which particles can be manipulated with these techniques makes them 
particularly useful for applications ranging from flow cytometry” to 
self-assembly'*. Fundamentally, however, these free-space systems are 
limited in two ways. First, diffraction limits how tightly the light can be 
focused and thereby limits the overall strength of the trap. Second, the 
trapping region has a very short focal depth, preventing the continu- 
ous transport of nanoparticles by means of radiation pressure. 

To improve trapping stability, a number of near-field methods 
have recently been developed'**. Examples are the use of interfering 
Gaussian beams reflected off a prism surface to sort 350-nm polysty- 
rene beads’, and the use of localized plasmonic resonances in sur- 
face-bound metallic nanostructures to trap 200-nm_ dielectric 
particles*. Waveguide-based optical transport””’ is analogous to 
these near-field methods in that the evanescent field extending into 
the surrounding liquid serves to attract particles to the waveguide. 
However, particles also experience photon scattering and absorption 
forces which propel them along it for a distance limited only by the 
losses in the system. Recent efforts in this area have demonstrated the 
sustained propulsion of dielectric microparticles*””*, metallic nano- 
particles*?** and cells**. The limitation that prevents these systems 


from manipulating smaller matter, including biomolecules, is that 
the particles only interact with the small portion of total transported 
light because most of it is confined within the solid core of the 
waveguide. 

Recently Almeida et al.° developed a nanophotonic structure, 
known as a slot waveguide, that can overcome this challenge. 
Illustrated in Fig. 1, the slot waveguide comprises a nanoscale slot 
sandwiched between two materials of much higher refractive index. 
As shown in Fig. 1b, there exists a pseudo-transverse-electric (TE) 
mode that has a large electric field discontinuity at the horizontal 
boundaries of the slot region. The result is a high-intensity eigenmode 
in the slot making the majority of the optical energy accessible within 
the low-index fluid region. Here we demonstrate the use of these sub- 
wavelength-scale slot waveguides for optical capture, trapping and 
transport of both dielectric nanoparticles and A-DNA molecules. 
We can achieve stable trapping of particles as small as 75 nm, repre- 
senting some of the smallest dielectric matter ever trapped or trans- 
ported using such a system. In addition to the experimental results, we 
also present an examination of the effect that the presence ofa particle 
has on the optical mode, details of the trap strength and stiffness in 
comparison with other techniques, and a unique stability analysis 
descriptive of the release kinetics for particles near the stability point. 

As shown in Fig. 2a (see also Supplementary Movies 1 and 2), we 
are able to capture and stably trap polystyrene nanoparticles (refrac- 
tive index n= 1.45) with diameters of 75nm and 100nm in slot 
waveguides with widths of 100nm and 120 nm, respectively. In all 
cases the optical power at the exit of the fibre used to couple light into 
the waveguide was less than 300 mW, the excitation wavelength was 
2 = 1,550 nm, and the trapping used TE polarization. Experimental 
details are available in the Supplementary Information. The first 
movie (Supplementary Movie 1) illustrates our ability to capture 
and accumulate flowing particles in the slot waveguide for indefinite 
periods of time and release them by either reducing the optical power 
or switching the polarization. Excitation of the slot waveguides using 
transverse-magnetic (TM) polarization required three to five times as 
much power to obtain stable trapping, so switching polarization also 
tended to break the trap. The microfluidic flow serves to transport the 
particles to the waveguide but does not have a role in the trapping 
itself. This is indicated by the fact that the trap breaks on removal of 
the optical excitation in the above experiments. 

The second movie (Supplementary Movie 2) illustrates the 
dynamics of the capture of flowing particles in a trap near the stability 
point, where the random thermal energy in the system is of the same 
order as the amount of work required to break the trap”®. The average 
retention time in such a trap is a statistical process governed by the 
release kinetics of the system, which in turn is governed by the trap 
strength, its stiffness and the location on the waveguide where the 
particle is trapped (as will be described below). In the experiment 
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Figure 1| Nanoscale optofluidic transport. a, Schematic illustrating the 
transport of two different sizes of nanoparticles in a slot waveguide. The 
force Fo, represents the radiation pressure force responsible for optofluidic 
transport, and F,,,, is the trapping force that holds nanoparticles within the 


slot region. b, Mode profile for a silicon-on-insulator 40-nm slot waveguide 


shown, the nanoparticles are flowing by in the microchannel at an 
average speed of 80 jum s_' and, as can be seen, the number of particles 
that are captured is low (less than 25%) in comparison with those 
that flow by. The reason for the low capture rate is that to be trapped, 
a particle must be on a streamline that passes through the evanescent 
field. This is analogous to a situation in which a flowing particle must 
pass through the focal point of a free-space optical tweezer in order to 
be trapped, and is not an inherent limitation of our system. The 
capture rate can be increased by decreasing the channel size, reducing 
the flow rate or increasing the optical power. 

As mentioned above, an advantage of this approach is the ability 
not only to capture nanoscopic matter but to transport it optically. 
This capability is important for the development of active nano- 
assembly techniques and for optically driven bioanalytics””. From 
Rayleigh theory it is well known that the radiation-pressure-based 
transport velocity of a dielectric nanoparticle is proportional to the 
local intensity and scales with the fifth power of particle radius**. As 
such it is extremely difficult to optically transport very small matter 
unless very high optical intensities can be achieved. As shown in 
Fig. 2b (and Supplementary Movie 3), using our slot waveguide 
system we have been able to demonstrate optical propulsion of 
100-nm particles at an average speed of 1.5 ums” (using 250 mW 
optical power measured at the exit of the fibre). Because the propul- 
sion velocity is inversely proportional to the fourth power of wave- 
length, one method by which the transport velocity could be 
increased is by using a different high-refractive-index material that 
is transparent at lower wavelengths. 


75-nm 
particles 


75-nm 
particles 


Laser turned off: 
particles released 


Nanoparticles passing 
over the waveguide collect 
in the slot 


Figure 2 | Trapping and transport of dielectric nanoparticles in a slot 
waveguide. Movies illustrating nanoparticle capture, transport and release 
are included in the Supplementary Information. a, Waveguide is optically 
excited while 75-nm polystyrene nanoparticles flow in the overlying 
microchannel over 100-nm slot waveguides. Over time, particles collect in 
the slot and also on the sides of the waveguide. At t = 0, the laser source is 
removed and particles are released from the waveguide. Immediately after 
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immersed in water, calculated using a finite-element simulation package. 
The main trapping region is in the high-intensity slot mode, although 
alternate trapping locations are located on the sides of the waveguide, where 
there are two decaying evanescent modes. ¢, Scanning electron microscope 
image of 100-nm slot waveguide structure 


Nanoscopic dielectric particles can be considered as coarse approx- 
imate models for biological species such as viruses and very small 
bacteria. Of perhaps greater interest is the ability to capture and opti- 
cally confine individual biomolecules. As shown in Fig. 3 (and 
Supplementary Movie 4) we have been able to capture from solution 
and stably trap individual strands of YOYO-1 tagged A-DNA mole- 
cules 48 kilobases long. As in previous cases, trapping was accom- 
plished using 250 mW of input optical power at a wavelength of 
1,550 nm; in this case, however, we used a 60-nm slot waveguide. As 
can be seen in the second half of the movie, when the power is removed 
the DNA strands are released. These experiments were done under pH 
conditions in which the DNA is reported to be in a partially extended 
state’. Although others have demonstrated optical trapping of A-DNA 
at pH levels at which the molecule is known to be in a supercoiled 
state’*, it has proved difficult to trap partially extended molecules 
because the focal point ofa tightly focused tweezer can only interrogate 
a small portion of the molecule. The slot waveguide technique allows 
us to trap extended molecules, as the confinement force is equivalently 
applied along a line rather than at a point. Further development of the 
transport technique may also allow us to develop new biomolecular 
separation mechanisms or new methods of interrogating single mole- 
cules for rapid sequencing” or direct haplotyping”. 

To characterize the trapping stability, stiffness and release kinetics 
better, we have carried out a detailed three-dimensional numerical 
analysis of the system. Referring back to Fig. 2, although most of the 
particles were observed to be trapped in the high-intensity slot region, 
trapping was also observed along the sides of the slot waveguide 
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release, a ‘cloud’ of particles forms as the particles leave their trapping sites 
and the released particles are carried down the channel by the fluid flow. 
b, Trapped 100-nm nanoparticles in 120-nm slot waveguides are transported 
a short distance by radiation pressure. Time-lapse images are cropped from 
images taken using a SensiCam CCD camera with contrast and brightness 
adjustments to the entire image. The cropping location is the same in each 
time-lapse image. 
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structure. Because the trap strength is related to the local optical field 
intensity, we can compare the behaviours of particles trapped in the 
two different regions to gauge the effect on trapping stability. To do 
this, we used a finite-element simulation to calculate the relative 
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Figure 4 | Trapping forces on particles trapped inside and outside the slot 
waveguide, and analysis of particle release kinetics. The work required to 
release a particle from the trap is found by integrating the force curves froma 
particle’s stable trapping position to infinity. Inset images show the 
calculated electric field distribution with the particle in the two stable 
trapping positions. The arrows indicate the direction of release. a, Trapping 
force for a 100-nm particle in a 120-nm slot plotted against its position 
relative to the height of the slot waveguide. The particle position is measured 
as the distance of the particle centre from the bottom of the 200-nm-tall slot 
waveguide. The trapping force reaches a maximum at the point where the 
field gradient is strongest at the entrance to the slot waveguide. b, Trapping 
force for a 100-nm particle side-trapped in the same structure with two 
possible release paths. Particle position is measured as the deviation of the 
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Figure 3 | Capture and trapping of .-DNA. 
Movies illustrating the DNA capture and release 
are included in the Supplementary Information. 
Images show individual YOYO-tagged 48- 
kilobase 1-DNA flowing over an optically excited 
60-nm-wide slot waveguide. At time t = 0 the 
encircled DNA is trapped. In this case the DNA is 
released at the 2.6-s mark and flows downstream. 
Supplementary Movie 4 shows the collection of 
DNA molecules over time and their release in 
response to removal of the optical excitation. 
Trapping conditions are identical to those used 
for the nanoparticle experiments shown in Fig. 2. 
Time-lapse images are cropped from images 
taken using a SensiCam CCD camera with 
contrast and brightness adjustments to the entire 
image. The cropping location is the same in each 
time-lapse image. 


t=1.35s 
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trapping force for a side-trapped particle in comparison with one 
trapped in the slot, shown in Fig. 4a, b. As can be seen, the trapping 
force for side-trapped particles is much smaller, and as a result less 
work energy is required to exceed the trapping release barrier. This 
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particle centre from its most stable position. The blue line corresponds to a 
particle being pushed off the waveguide by an external force, and the green 
line corresponds to a particle being lifted up off the waveguide. The trapping 
forces listed are normalized to 1 W of guided power. c, Experimental data for 
N/Np plotted against normalized time, t. Particles trapped in the slot are 
represented by crosses, side-trapped particles are represented by circles, and 
DNA by X signs. The quantity N/No represents the relative ‘concentration’ 
of particles trapped on the waveguide. d, Plot of F(N) (see Supplementary 
Information) for a second-order rate law (1/N—1/Npo) versus t. The same 
symbols are used here as in ¢. The lines represent linear fits to the data, from 
which the release rate constant can be obtained. In this case sharper slopes 
represent a higher release rate and therefore a less stable trap. 
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provides an avenue to differentiate the two situations, as we would 
expect side-trapped particles to be released more easily than their slot- 
trapped analogues. Figure 4a, b also illustrates that the optical mode 
of the slot waveguide is not greatly perturbed by the presence of the 
particle in either position. 

Because the direction of trap release is in the vertical direction, and 
because of the physical confinement provided by the channel walls 
and the fact that the trapping force does not vary along the length of 
the waveguide, it is difficult to extract quantitative values for the 
trapping stability experimentally. From the numerical computations, 
however, we can estimate the trap stiffness from the slope of the 
force-distance curve as being 0.2 pN per nanometre for a 100-nm 
nanoparticle at 1-W excitation power. Although a direct comparison 
is difficult, this is significantly higher than previously described for 
larger particles using other near-field techniques (see Grigorenko 
et al.‘ who reported a 0.013 pNnm’ stiffness for a 200-nm bead). 

As described in previous analytical work”, for a particle in an optical 
near field, there is a finite amount of work energy required to remove a 
particle from a stably trapped location to one where the trap no longer 
has any influence. When trapping is relatively weak and the particles 
small, the random thermal energy in the system will eventually exceed 
this work and the particle will be released. In such cases, the work 
energy is analogous to an activation energy barrier that impedes the 
release of particles from the waveguide, and the kinetic behaviour is 
similar to molecular desorption from a surface (see Supplementary 
Information for the theoretical basis). Understanding how related 
parameters such as slot width, particle composition and particle size 
affect the release rate of trapped nanoparticles yields information 
necessary for the engineering of robust, stable slot transport devices. 

We carried out a large number of trapping experiments for nano- 
particles trapped inside the slot, nanoparticles trapped outside the 
slot, and the trapped DNA. Trapping was done near the stability 
point such that the targets would self-release from the waveguide 
structure. Figure 4c shows a plot of the total number of trapped 
targets on the waveguide as a function of normalized time. The aver- 
age release time for particles trapped inside the slot is larger than that 
for those trapped outside, suggesting greater stability, consistent with 
our earlier numerical predictions. The trapping stability is related to a 
kinetic constant, k, which can be obtained by plotting the above data 
as a function of reduced time for an appropriate rate law (see 
Supplementary Information). In Fig. 4d we see that side-trapped 
particles have a larger rate constant, suggesting that the release 
(desorption) is faster and a lower work energy is required for release. 
It is not yet clear why the release process seems to have a second-order 
rate, but we believe that it results from the exponential decay of the 
electromagnetic trapping force coupled with hydrodynamic drag. 
DNA shows a lower stability than the polystyrene nanoparticles, 
probably because of the extended conformation that it obtains dur- 
ing trapping. As the trapping stability is likely to be strongly depend- 
ent on molecular conformation, analysis of the release kinetics in 
such systems may result in a new method of single-molecule analysis. 

Sub-wavelength slot waveguides such as those used in this work 
can be integrated into lab-on-a-chip platforms using existing man- 
ufacturing techniques. They allow discrete optical manipulation and 
transport of nanoscopic objects with greater precision than is avail- 
able with existing approaches. The fusion of nanofluidics and optical 
manipulation in this manner could lead to new methods of bioana- 
lysis and directed assembly. 


METHODS SUMMARY 


We fabricate the slot waveguide chips using electron beam lithography. After 
experimental use, the chips are cleaned using Nanostrip, and scum is removed by 
using a reactive ion oxygen plasma etching process. The total width of the 
waveguides is 450 nm with slot widths ranging from 60 nm to 120nm. The slot 
waveguides are transitioned to nanotaper devices clad in silicon oxide to increase 
the coupling efficiency. The laser source is a tunable 1,550-nm laser that runs to a 
tapered lensed fibre, using the same set-up previously used for our SU-8 wave- 
guide experiments”. 
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The particle solution consists of suspended fluorescent polystyrene nanopar- 
ticles 75nm and 100nm in diameter (Duke Scientific) with refractive index 
n= 1.574ina 100 mM phosphate buffer solution. The particles have a dispersity 
in diameter of about 10%. The high ionic concentration of the buffer solution 
suppresses electrostatic interactions in the system and maintains a constant pH 
during experiments. We add 1%-by-volume Triton X-100 non-ionic surfactant 
to the particle solution to prevent aggregation of the nanoparticles and to limit 
adhesion of particles to the surface of the devices and PDMS microchannels. 

The experiments use devices that are bonded to a PDMS microchannel 
100 tm wide and 5 um tall. The fluidics are driven using an adjustable air- 
pressure system designed to maintain a constant pressure to the device. The 
power output of the fibre during the trapping experiments was set from 250 
to 300 mW. Particle trapping was confirmed by counting immobilized particles 
and counting the number of released particles. We determined particle velocity 
measurements and particle trapping times by using the Image) particle tracking 
software. Images of the experiments were captured at a rate of 55 ms per frame 
using a SensiCam CCD camera. 

As mentioned in the main text, we carried out experiments demonstrating 
A-DNA trapping using the technique described above, with the exception of the 
smaller 60-nm-wide slot waveguide. The A-DNA molecules (New England 
BioLabs) were stained with YOYO-1 intercalating dye (Molecular Probes) so 
that they could be observed using traditional fluorescence microscopy. The 
buffer consisted of 10mM Tris Base (J.T. Baker), 1 mM EDTA (Fisher), and 
10 mM sodium chloride (Mallinckrodt) at a pH of 7.8. Poly(n-vinylpyrrolidone) 
(PVP, Sigma) 2% (by weight) was added to reduce unspecific binding of DNA to 
channel walls. 
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Seismic evidence for overpressured subducted 
oceanic crust and megathrust fault sealing 


Pascal Audet'}, Michael G. Bostock', Nikolas |. Christensen’” & Simon M. Peacock’ 


Water and hydrous minerals play a key part in geodynamic processes 
at subduction zones’ * by weakening the plate boundary, aiding slip 
and permitting subduction—and indeed plate tectonics—to occur’. 
The seismological signature of water within the forearc mantle 
wedge is evident in anomalies with low seismic shear velocity 
marking serpentinization*’. However, seismological observations 
bearing on the presence of water within the subducting plate itself 
are less well documented. Here we use converted teleseismic waves to 
obtain observations of anomalously high Poisson’s ratios within the 
subducted oceanic crust from the Cascadia continental margin to 
its intersection with forearc mantle. On the basis of pressure, 
temperature and compositional considerations, the elevated 
Poisson’s ratios indicate that water is pervasively present in fluid 
form at pore pressures near lithostatic values. Combined with obser- 
vations of a strong negative velocity contrast at the top of the oceanic 
crust, our results imply that the megathrust is a low-permeability 
boundary. The transition from a low- to high-permeability plate 
interface downdip into the mantle wedge is explained by hydrofrac- 
turing of the seal by volume changes across the interface caused by 
the onset of crustal eclogitization and mantle serpentinization. These 
results may have important implications for our understanding of 
seismogenesis, subduction zone structure and the mechanism of 
episodic tremor and slip. 

In recent years, it has become increasingly apparent that large 
quantities of water (H,O) are stored in the mantle wedge as the 
serpentine mineral antigorite*. Compelling evidence for the presence 
of antigorite comes from seismic observations that reveal the pres- 
ence of highly resolved, low-shear-velocity anomalies in both warm 
(young slab, for example Cascadia®) and cold (old slab, for example 
northeast Japan’) subduction zone environments in locations pre- 
dicted by thermal and petrological models’. Water in the mantle 
wedge most probably originates from the dehydration of hydrous 
minerals within subducting oceanic crust and mantle, which will 
occur at shallower depths (<50km) in warmer environments’”. 
Hydrous minerals form within oceanic lithosphere (1-2% H,O in 
the crust’) as a result of hydrothermal circulation and alteration at 
spreading ridges, along fracture zones, and in the region of the trench 
and outer rise. There is little constraint, however, on the in-situ 
abundance and distribution of H2O in the fluid and hydrous miner- 
als within subducting oceanic crust, even though this knowledge is 
likely to be important in assessing the strength of the plate interface 
and nature of seismogenesis. Recent studies of episodic tremor and 
slip (ETS) in both Cascadia and Japan point to an origin that 
involves, either directly or indirectly, fluids near the plate boundary, 
downdip of the locked zone but updip of its intersection with forearc 
mantle*"°. Characterizing the physical state of subducted oceanic 
crust and distribution of fluids therein may help in understanding 
the temporal and spatial occurrence of ETS events. 


We use observations of converted teleseismic waves (or receiver 
functions) recorded over a POLARIS portable array of seismometers 
across Vancouver Island at spacings of approximately 10 km (Fig. 1). 
All receiver functions from this experiment are plotted in raw form 
from west-southwest to east-northeast according to station position 
and, for individual stations, according to their sampling points 
(Fig. 2a). These data are sensitive to structures with scale lengths of 
1-10 km and are dominated across the entire profile by the signature 
of an eastward-dipping low-velocity zone. This signature includes 
three sets of oppositely polarized pulses comprising forward- 
scattered P-to-S (Ps, 3-4 at station PGC) conversions, and back- 
scattered P-to-S (Pps, 13-15s at PGC) and S-to-S (Pss, 17—20s at 
PGC) conversions afforded by reflection of the teleseismic wavefield 
at the Earth’s free surface. These signals coincide with prominent, 
dipping seismic reflectors and have been previously interpreted to 
represent oceanic crust of the subducting Juan de Fuca plate’, 
consistent with its expression in studies further south beneath 
Oregon’ and worldwide*'*”, 

Here we re-examine the signature of the different scattered modes 
in more detail. It is widely appreciated'**> that the timing (relative to 
the incident P-wave arrival, time 0 in Fig. 2a) of the scattered modes 
from a given interface can be used to constrain both depth and average 
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Figure 1| Geographical map of northern Cascadia subduction zone. 
Location of the three-component, broadband seismic stations that recorded 
data used in this study are shown as inverted red triangles. The deformation 
front is indicated by the thin solid black line. Note that the array is oriented 
approximately perpendicular to the strike of subduction, which trends 
N45°W. 
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Figure 2 | Radial receiver function results beneath northern Cascadia. 

a, Receiver function results for all data filtered between 0.05 and 0.5 Hz used 
in the analysis sorted by station position along the array and, for each 
station, by back-azimuth of incident wavefield. Amplitudes are relative to 
incident P-wave. Red and blue colours correspond to velocity increase and 
decrease with depth, respectively. Forward- (Ps) and back-scattered (Pps and 
Pss) phases show oppositely polarized pulses that are manifest of waves 
converted from the top and bottom boundaries of a dipping, low-velocity 
layer, indicated by blue and red labels. b, Estimated Vp/V; and Poisson’s 
ratios for the low-velocity layer. Errors are standard deviations from 
bootstrap estimates. Elevated values (~0.4) are interpreted as near- 
lithostatic pore-fluid pressures. ¢, S-velocity perturbations relative to a one- 
dimensional reference model (V;*) using a Born approximation for radial Ps 
phases. Error bars are one standard deviation. Positive (red) and negative 
(blue) velocity contrasts from the bottom and top, respectively, of the 
low-velocity layer are of similar amplitude. 


P-to-S velocity ratio (Vp/Vs) of the overlying column. It can also be 
shown (see Supplementary Information) that relative times between 
two arrivals from distinct boundaries within the lithospheric column 
for each set of scattered modes can be used similarly to characterize the 
interval properties between the two boundaries, largely independent 
of overlying structure, even for dipping layers. 

This recognition allows us to investigate the velocity structure 
(that is, Vp/Vs) of the downgoing oceanic crust. The key constraint 
is the ratio of the direct conversion delay time (ATp,, the difference in 
time between direct conversions from the top and bottom of the 
oceanic crust) to the corresponding back-scattered quantities 
(ATpps, ATpss). In the two extremes as Vp/Vs tends to 1 and ~, 
ATp, will tend to 0 and ATp,; (ATp,s/2), respectively. Care must be 
taken in measuring these quantities, because low-pass filtering at 
corner periods greater than twice the true separation between pulses 
will bias measurements of ATp,, ATpps and ATp,, to greater values 
(see Supplementary Information). 

Figure 2b shows our estimates of oceanic crustal Vp/Vs; beneath 
each station of the array. These estimates are extremely high and it 
becomes more practical to speak in terms of Poisson’s ratio (¢, for 
which there is an upper bound of 0.5 for inviscid fluids) around 0.4. 
These values cannot be due to compositional effects (that is, minera- 
logy within the downgoing oceanic crust). The common oceanic 
crustal rocks basalt, diabase and gabbro have o between 0.28 and 
0.29. At the prevailing temperatures and pressures (~1 GPa, 
~500 C), these rocks will have been metamorphosed to upper amphi- 
bolite facies leading to a drop in a to 0.26 or 0.27 (ref. 16). Antigorite, 
the high-temperature variety of serpentine, has a o of 0.29 (ref. 17). 
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Even the most extreme candidates, the low-pressure serpentine 
minerals chrysotile and lizardite'’, have o below 0.37, and their pres- 
ence can be further ruled out on the basis of pressure, temperature and 
compositional considerations'*. The only plausible explanation is that 
of high pore-fluid pressures. Widely documented high pore-fluid 
pressures existing at shallower levels in accretionary prisms are due 
primarily to porosity reduction” but it is more likely that our obser- 
vations indicate mineral dehydration”®. Unfortunately, a precise cor- 
respondence between o and pore-fluid pressure is hampered by the 
lack of laboratory measurements at confining pressures correspond- 
ing to 20-40 km depth. Nonetheless, extrapolation of measurements 
made at lesser pressures”’ shows that it is it likely that the oceanic crust 
beneath Vancouver Island is overpressured (near lithostatic), a result 
with a range of important implications. 

Evidence for high pore-fluid pressure in the Nankai subduction 
zone has been presented recently’”’ but these studies, using different 
forms of travel-time tomography, have found less extreme values of 
Vp/ Vs. Measurements of Vp/Vs derived from tomography are likely 
to be biased towards normal values through the blurring effects of 
regularization, simultaneous earthquake relocation and the inherent 
difficulties in imaging low-velocity zones with travel times*”’. 
Estimates of ¢ obtained using scattered waves are likely to be better 
resolved owing to their greater sensitivity to local velocity changes. 

Our documentation of high pore-fluid pressure implies that the 
plate interface probably represents a low-permeability boundary. 
Using Darcy’s law, we estimate the permeability of the interface to 
be ~5X 10°” to ~5X 10°” m’, assuming pore-fluid pressures 
equal to lithostatic pressures beneath the interface, a 0.65-GPa pres- 
sure drop across the interface corresponding to hydrostatic pore- 
fluid pressure above the interface at a depth of 35 km, a slab fluid 
production rate of 10 *m?(m? yr) | (ref. 5), an interface thickness 
of 1 to 1,000 m, and a dynamic viscosity of 10 * Pas for H,O at 500 C 
and 1 GPa. These estimates are lower than measurements of basaltic 
oceanic basement permeabilities in the range 10-7’ to 10°'! m* 
made at length scales of 1 to 1,000 m (ref. 24). The significance of 
our estimate is reinforced by noting that the greatest permeabilities 
are found within the upper few hundred metres of oceanic crust and 
increase with scale length”*. The low permeability of the plate inter- 
face may reflect deformation-induced grain-size reduction” or the 
precipitation of minerals from migrating fluids**”’. 

Further evidence supporting a low-permeability plate boundary at 
crustal levels stems from scattered wave amplitudes that are sensitive 
to velocity contrasts at interfaces (Fig. 2c). Amplitudes of arrivals for 
a given scattering mode originating from the bottom and top oceanic 
crustal boundaries are comparable, indicating similar S-velocity 
jumps of +1 kms" ', respectively. The lower boundary jump is con- 
sistent with that expected for the oceanic Moho which marks the 
juxtaposition of lower-velocity gabbro with higher-velocity mantle 
peridotite. The lower crust of the overriding plate is thought to be 
formed of dominantly mafic accreted terranes** and would thus be 
expected to afford little or no contrast in seismic velocity with oceanic 
crust. Prominent amplitudes of scattered waves from the upper 
boundary thus support a sharp contrast from low-velocity oceanic 
crust containing trapped fluids to higher velocity overriding crust 
characterized by much lower pore-fluid pressure. 

Low permeability of the plate boundary at depths less than 35 km 
contrasts with the seismic expression of forearc mantle serpentiniza- 
tion, which appears downdip at depths greater than 40 km in this 
region” and suggests that HO is readily transported into the wedge 
(Fig. 3). A change in the nature of the plate boundary seal can be 
explained by the large (> 10%) volume reduction and release of H,O 
that accompanies eclogitization”, the last significant metamorphic 
facies transition experienced by oceanic crust. Both this reaction and 
the still greater volume expansion accompanying serpentinization of 
the mantle wedge” could be expected to increase permeability near 
the plate boundary through fracture generation. This interpretation 
is consistent with the seismic signature in the cold subduction zone 
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Figure 3 | Schematic interpretation of receiver function results. Updip 
limit of overpressured oceanic crust is unconstrained. Note vertical 
exaggeration. 


setting of northeast Japan, where a serpentinite layer is interpreted to 
form above subducting oceanic crust at onset of eclogitization near 
~90 km depth®. 

It is worth noting that in warm subduction zones, ETS tends to 
occur in the vicinity of the plate interface near but seaward of the 
wedge corner*”. Our documentation of high pore-fluid pressures in 
this region supports two groups of models that involve fluids: (1) ETS 
is triggered by hydrofracturing at the plate interface’; and (2) ETS 
occurs where high pore-fluid pressures extend the region of con- 
ditionally stable slip’*'. A downdip change in permeability across 
the plate interface could be explained by hydrofracturing of the seal 
where hydraulic conductivities become sufficiently high near the onset 
of eclogitization. Moreover, it might represent an important factor in 
altering the mode of slip along the thrust within the frictional stability 
transition zone. Our model suggests the possibility of linking the 
recurrence of ETS at regular time intervals to periodic cycles of steady 
pore-fluid pressure build-up from dehydration of subducted oceanic 
crust, fluid release from fracturing of the interface during ETS, and 
subsequent precipitation sealing of the plate boundary. 

Finally, our documentation of high pore-fluid pressure in the sub- 
ducting plate has important implications for the interpretation of 
tomographic velocity images, and the position of the plate boundary 
interface. High pore-fluid pressures result not only in elevated o (and 
Vp/ Vs) but also in depressed absolute velocities (—20% and —30% for 
P-waves and S-waves, respectively, in the measurements of 
Christensen'®). Consequently, identification of a tomographic iso- 
velocity contour that does not account for this effect, as proxy for 
the oceanic Moho** will yield interface depths that are significantly 
overestimated’. 
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Indirect reciprocity provides only a narrow margin of 
efficiency for costly punishment 


Hisashi Ohtsuki’”, Yoh lwasa°® & Martin A. Nowak? 


Indirect reciprocity'> is a key mechanism for the evolution of 
human cooperation. Our behaviour towards other people depends 
not only on what they have done to us but also on what they have 
done to others. Indirect reciprocity works through reputation” "’. 
The standard model of indirect reciprocity offers a binary choice: 
people can either cooperate or defect. Cooperation implies a cost 
for the donor and a benefit for the recipient. Defection has no cost 
and yields no benefit. Currently there is considerable interest in 
studying the effect of costly (or altruistic) punishment on human 
behaviour'*”*. Punishment implies a cost for the punished person. 
Costly punishment means that the punisher also pays a cost. It has 
been suggested that costly punishment between individuals can 
promote cooperation. Here we study the role of costly punishment 
in an explicit model of indirect reciprocity. We analyse all social 
norms, which depend on the action of the donor and the reputation 
of the recipient. We allow errors in assigning reputation and study 
gossip as a mechanism for establishing coherence. We characterize 
all strategies that allow the evolutionary stability of cooperation. 
Some of those strategies use costly punishment; others do not. We 
find that punishment strategies typically reduce the average payoff 
of the population. Consequently, there is only a small parameter 
region where costly punishment leads to an efficient equilibrium. In 
most cases the population does better by not using costly punish- 
ment. The efficient strategy for indirect reciprocity is to withhold 
help for defectors rather than punishing them. 

Human societies are organized around cooperative interactions. 
But why would natural selection equip selfish individuals with altru- 
istic tendencies? This question has fascinated evolutionary biologists 
for decades. One answer is given in terms of direct reciprocity**”’. 
There are repeated encounters between the same two individuals: I 
help you, and you help me. More recently, indirect reciprocity has 
emerged as a more general model: I help you, and somebody helps 
me. Indirect reciprocity is based on reputation’. People monitor the 
social interactions within their group. Helping others establishes the 
reputation of being a helpful individual. Natural selection can favour 
strategies that help those who have helped others*'’. The conse- 
quences for widespread cooperation are enormous. Direct recipro- 
city is like an economy based on the exchange of goods, whereas 
indirect reciprocity resembles the invention of money. The money 
that feeds the engines of indirect reciprocity is reputation. For direct 
reciprocity, my strategy depends on what you have done to me; for 
indirect reciprocity, my strategy also depends on what you have done 
to others. Direct and indirect reciprocity are mechanisms for the 
evolution of cooperation”’. 

Punishment refers to an action that implies a cost for the punished 
person. Costly punishment means that the punisher also pays a cost 
for exercising punishment. In certain experimental situations costly 
punishment has been called ‘altruistic punishment’, because the 


punishers cannot expect any material gain from their action’. In 
reality, however, most punishment actions among humans are assoc- 
iated with the expectation of a delayed material gain; they are there- 
fore not altruistic. 

The suggested idea for the evolution of cooperation is that people 
might be more willing to cooperate under the threat of punishment. 
However, we note that costly punishment is not a separate mechanism 
for the evolution of cooperation but a form of direct or indirect 
reciprocity. IfI punish you because you have defected with me, then 
I use direct reciprocity. If] punish you because you have defected with 
others, then indirect reciprocity is at work. In the setting of direct 
reciprocity, punishment is a form of retaliation’. For indirect reci- 
procity, punishment works through reputation and also includes 
third-party actions, which means that observers of an interaction 
are willing to punish defectors at a cost to themselves’. Therefore, 
any discussion of the evolution of costly punishment brings us imme- 
diately into the framework of direct or indirect reciprocity. 

In general, the reputation score could be a continuous variable’, 
but here we consider a simple model with binary reputation. People 
have either a good reputation (G) or a bad reputation (B). At times, 
two random players are chosen from the population, one in the role 
of donor, the other in the role of recipient. The donor can either 
cooperate (C), defect (D) or punish (P). Cooperation means the 
donor pays a cost c, and the recipient gets a benefit b. Punishment 
implies that the donor pays a cost « and the recipient incurs a cost f. 
For defection there is no cost and no benefit. 

The interaction between the donor and the recipient is observed by 
the other members of the population (Fig. 1). The reputation of the 
donor is updated according to a social norm. First-order assessment 
depends only on the action of the donor; for example, cooperation 
leads to a good reputation, whereas defection leads to a bad repu- 
tation. Second-order assessment’ depends both on the action of the 
donor and the reputation of the recipient: for example, it could be 
deemed ‘good’ to cooperate with a good recipient but ‘bad’ to coop- 
erate with a bad recipient. Here we study social norms that use second- 
order assessment. The donor has three possible moves (C, D or P) and 
the recipient has one of two reputations (G or B). There are therefore 
six combinations and 2° = 64 social norms with second-order assess- 
ment. All detailed calculations are shown in the Supplementary 
Information. 

Any interaction leads to either a good or a bad reputation for the 
donor. We assume that this process of reputation updating is subject 
to errors. There may be wrong observations or the spread of false 
rumours. With probability 1 an incorrect reputation is assigned and 
adopted by all. In the simplest model, everyone has the same opinion 
of everyone else. There are no private lists of reputation. Triggering a 
wrong reputation affects everyone equally. The parameter q = 1 — 2 
quantifies the ability of the population to distinguish between good 
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Action: cooperate, defect or punish 


ese 8 


Donor Recipient has 
: 4 a good or bad 


reputation 


DP” » pp 


Observers update the reputation of the donor with 
errors in perception and gossip 


Figure 1| Indirect reciprocity with costly punishment. The donor chooses 
one of three actions: cooperate, defect or punish. The recipient has a binary 
reputation, which is either ‘good’ or ‘bad’. The donor’s choice depends on 
the recipient’s reputation. The donor’s action is observed by other members 
of the population, who update the donor’s reputation according to a social 
norm, which is shared by all. The donor is assigned an incorrect reputation 
with probability pu. The ‘social resolution’, q = 1 — 2, is a key parameter of 
indirect reciprocity: it defines the ability to distinguish between good and 
bad. 


and bad. We call qthe ‘social resolution’. If 4 = 1/2 then reputation is 
assigned at random, and there is no ability to distinguish between 
good and bad (q= 0). 

Games of indirect reciprocity contain social norms and action 
rules. The action rule specifies for the donor whether to cooperate, 
defect or punish a recipient who is either good or bad. For example, 
the action rule CD prescribes cooperation with a good recipient and 
defection with a bad recipient; this rule does not use costly punish- 
ment. In contrast, the action rule CP prescribes cooperation with 
good recipients and punishment of bad recipients. The action rules 
CC, DD and PP encode, respectively, unconditional cooperation, 
defection and punishment. In total, there are nine possible action 
rules. 

For each of the 64 social norms we study the competition of all nine 
action rules. We assume that everyone in the population has the same 
social norm, and we evaluate whether this norm allows the evolu- 
tionary stability of action rules that specify cooperation with good 
recipients. There are only two candidates for such action rules, CD 
and CP, because CC is not stable against invasion by defectors (DD). 
Figure 2 shows all social norms that allow the evolution of coopera- 
tion. The action rule DD is evolutionarily stable for any social norm. 

Social norms that stabilize the CD action rule have the following 
properties: (1) cooperation with a good recipient leads to a good 
reputation; (2) defection against a good recipient leads to a bad 
reputation; and (3) defection against a bad recipient leads to a good 
reputation. The three remaining positions in the norm can be either 
G or B. If the cost of cooperation is greater than the cost of punish- 
ment (c>«), then punishing a good recipient must lead to a bad 
reputation; otherwise a donor can keep a good reputation by using 
the cheaper punishment option instead of the more expensive 
cooperation move. 

Social norms that stabilize the CP action rule have the following 
properties: first, cooperation with a good recipient leads to a good 
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Figure 2 | Social norms of cooperation. We have determined all social 
norms that allow the evolutionary stability of action rules prescribing 
cooperation with good recipients. There are two such action rules, CD and 
CP. The former ‘punishes’ bad recipients by defection (D); the latter uses 
costly punishment (P). In a the cost of cooperation exceeds the cost of 
punishment; in b the cost of cooperation is less than the cost of punishment. 
There is an intuitive summary of all successful social norms: following the 
action rule maintains a good reputation; deviating from it can lead to a good 
or bad reputation; a deviation that is less costly than the prescribed action 
must lead to a bad reputation. An asterisk denotes G or B. The detailed 
parameter requirements for evolutionary stability are given in the text and in 
the Supplementary Information. 


reputation; second, defection always leads to a bad reputation; and 
third, punishing a bad recipient leads to a good reputation. 

CD action rules are evolutionarily stable, if the social resolution 
exceeds the cost-to-benefit ratio (q> c/b). In contrast, CP action 
rules are evolutionarily stable if q>max{c,x}/(b+ B). Note that 
costly punishment can stabilize cooperation even if q< c/b. Thus, 
costly punishment can in principle extend the stability range of 
cooperation. DD action rules are always evolutionarily stable. 

We have performed computer simulations in heterogeneous 
populations of finite size to test the validity of our analytical calcula- 
tions. We find that the CD and CP action rules are stable against 
invasion by other action rules under the appropriate social norms 
and given the right parameter region. In the simplest simulations 
everyone has the same information about the reputation of others. 
In the extended simulations we drop this assumption. Now there are 
individual errors in assigning reputation. Consequently everyone has 
a private list of the reputation of others. These errors can destroy 
indirect reciprocity unless there is a mechanism for re-establishing 
coherence. Gossip is such a mechanism. We assume that individuals 
talk to each other and sample each other’s opinions (as in a ‘voter 
model’). If there are enough communication events, then we observe 
the evolutionary stability of our strategies as predicted. We have also 
studied errors resulting in execution of the wrong action (‘trembling 
hand’) or recalling an incorrect reputation (‘fuzzy mind’). Our 
results are robust as long as these errors are not too frequent. All 
simulations are described in the Supplementary Information. 

For some parameter regions, multiple action rules are evolutiona- 
rily stable. We therefore ask the following question: for all possible 
parameter regions, which of the three action rules CD, CP and DD are 
stable, and which one is the most efficient in the sense of leading to 
the highest average payoff at equilibrium? We obtain the following 
answer. (1) If q>c/b, then CD is most efficient. (2) If 
clb>q> cl(b + f) then CP is stable and more efficient than DD, if 
the following two conditions hold: 
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Otherwise DD is more efficient than CP. If b< c, then DD is always 
more efficient than CP. (3) If d/(b+ f) > q, then only DD is evolu- 
tionarily stable. 

Thus, if the accuracy of assigning the correct reputation, q, is too 
low, then only DD is efficient. If q is sufficiently large, then CD is 
efficient. For intermediate values of q there can be a small window 
where CP is efficient. However, the existence of this parameter region 
depends on whether the key parameters b, c, x and f fulfil the con- 
straints given by equation (1). Let us consider a numerical example. If 
b=2, c=1, «=1/2 and f =2, then CD is efficient for q> 1/2, 
whereas CP is efficient for 1/2 >q> 3/7 and DD is efficient for 
3/7 > q. If we increase the effect of punishment to f = 5/2 (or larger), 
there is no region left where CP is efficient. Intuitively, if CD is 
evolutionarily stable, it is always the most efficient equilibrium. If 
it is not stable, the remaining parameter region where CP is stable and 
more efficient than DD is very small or non-existent. Figure 3 illus- 
trates the narrow margin of efficiency of costly punishment. 

These considerations of efficiency do not imply that all populations 
will evolve towards punishment-free action rules. It is possible that a 
population is stuck at an inefficient equilibrium for a long time. A 
model with contingent movement allows us to study the competition 
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Figure 3 | The marginal efficiency of costly punishment. Projections of the 
five-dimensional (b, c, «, / and q) parameter space onto various two- 
dimensional planes. The parameters b and c denote the benefit and cost of 
cooperation. The parameters « and /} denote the cost and effect of 
punishment. The social resolution of the system is given by q, the probability 
of distinguishing between good and bad in a world where errors in 
assignment of reputation are possible. The symbols CD, CP and DD 
represent the region where the corresponding action rule is the most efficient 
equilibrium, which means the evolutionarily stable strategy with the highest 
average payoff. CD means cooperation with good recipients and defection 
with bad ones. CP means cooperation with good recipients and punishment 
of bad ones. DD is unconditional defection. Costly punishment is an 
efficient equilibrium only for a very constrained parameter region (shown in 
yellow). a, Projections on the /-q plane for b = 3 and « = 0.8 or 1.2. 

b, Projections on the b-q plane for /} = 3 and « = 0.8 or 1.2. ¢, Projections on 
the o-f plane for b = 1.5 and q = 0.3, 0.6 or 0.9. We always use c = 1. 
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of different social norms. We examine a simple model in which two 
groups have two different social norms. One norm stabilizes CD, 
whereas the other norm stabilizes CP. People interact only within their 
own group, but sometimes they compare their payoff with individuals 
from the other group. People might move to the other group and 
adopt its social norm if they find that its members have a higher payoff. 
We observe rapid selection of the efficient equilibrium (see 
Supplementary Information). 

In an experimental study, the observers of a Prisoner’s Dilemma 
game between two other people sometimes punish defectors at a cost 
to themselves”. This behaviour is a form of indirect reciprocity. In 
another experiment”’, a public goods game is followed by one round 
of punishment and then by one round of cooperation or defection. 
This setup is not directly comparable with our model, but the obser- 
vation is that adding the third round reduces the amount of punish- 
ment that is being used in the second round. This particular finding is 
in agreement with our result: other possibilities of indirect recipro- 
city reduce the amount of costly punishment. In the context of our 
theory it would be important to extend both experiments to permit 
reputation-building over multiple rounds of interaction and a choice 
between cooperation, defection and costly punishment in every 
round. We predict that such an experiment will show that costly 
punishment is an inefficient behaviour for most parameter regions. 

We have studied the effect of costly punishment in an explicit 
model of indirect reciprocity. We have analysed all social norms that 
use binary reputation and second-order assessment. We find that 
both CD and CP action rules can stabilize cooperation. These rules 
reward good recipients with cooperation and ‘punish’ bad ones with 
either defection (CD) or costly punishment (CP). If both CD and CP 
action rules are evolutionarily stable, the use of costly punishment 
leads to a lower equilibrium payoff and is therefore inefficient. It is 
even possible that costly punishment yields a lower payoff than all- 
out defection (DD). Costly punishment maximizes the group average 
payoff for only a very limited parameter region. This narrow margin 
of efficiency requires fine-tuning of the key parameters. If the social 
resolution exceeds the cost to benefit ratio (q > c/b), CD rules are 
always more efficient than CP rules. The evolution of improved 
mechanisms of indirect reciprocity therefore leads to societies in 
which costly punishment between individuals is not an efficient 
behaviour for promoting cooperation. 


METHODS SUMMARY 


An action rule s is formulated as a mapping from {G, B} (the recipient’s repu- 
tation) to {C, D, P} (the prescribed action). A social norm n is a mapping from 
the product of {C, D, P} (the donor’s action) and {G, B} (the recipient’s repu- 
tation) to {G, B} (the donor’s new reputation). We search for the combination of 
an action rule s and a social norm n that satisfies the following two properties: 
first, the monomorphic population in which all players adopt s and n achieves 
full cooperation in the absence of errors; and second, the action rule s is evolu- 
tionarily stable under the social norm n. We check these two criteria for each of 
all 9 X 64 =576 possible combinations of action rule and social norm (s, n). 
From the first criterion, action rule s must use cooperation (C). Because of the 
symmetry in the binary labels G and B we can assume without loss of generality 
that the action rule prescribes cooperation to good recipients; that is, s(G) = C. 
To study the evolutionary stability of the action rule s, we use dynamic optimiz- 
ation. We assume that the social norm is 1 and that all players except the focal 
player adopt action rule s. Under this assumption we calculate the best-response 
action rule s* of the focal player. If s* exists uniquely and matches s, then s is 
evolutionarily stable under n. Coexistence of different action rules”® is not within 
the scope of our analysis. See the Supplementary Information for further details. 
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Emergence of complex cell properties by learning to 
generalize in natural scenes 


Yan Karklin'+ & Michael S. Lewicki'+ 


A fundamental function of the visual system is to encode the build- 
ing blocks of natural scenes—edges, textures and shapes—that sub- 
serve visual tasks such as object recognition and scene 
understanding. Essential to this process is the formation of abstract 
representations that generalize from specific instances of visual 
input. A common view holds that neurons in the early visual system 
signal conjunctions of image features'’, but how these produce 
invariant representations is poorly understood. Here we propose that 
to generalize over similar images, higher-level visual neurons encode 
statistical variations that characterize local image regions. We pre- 
sent a model in which neural activity encodes the probability distri- 
bution most consistent with a given image. Trained on natural 
images, the model generalizes by learning a compact set of dictionary 
elements for image distributions typically encountered in natural 
scenes. Model neurons show a diverse range of properties observed 
in cortical cells. These results provide a new functional explanation 
for nonlinear effects in complex cells** and offer insight into coding 
strategies in primary visual cortex (V1) and higher visual areas. 

As we scan across a complex natural scene, fixations at multiple 
locations (for example, on the trunk of a tree or along its edge) 
produce a coherent percept of the underlying structure (the bark 
texture or the contour of the edge), even though individual images 
collected at the retina are inherently highly variable. Figure 1 illus- 
trates the problem our brain solves so effortlessly: perceptually dis- 
tinct image regions produce response patterns that are highly 
overlapping and cannot be easily distinguished using low-level, linear 
representations. What sort of computations are required to achieve 
generalization across natural stimuli? 

Early visual neurons are typically described as linear feature detec- 
tors'*. Models developed around this idea can accurately capture the 
behaviour of neurons from the retina’ to simple cells in the cortex* 
but, as the examples in Fig. 1 illustrate, neither individual features nor 
linear transformations can reliably discriminate images of one struc- 
ture from another. More abstract features are presumably computed 
in later stages of the visual system, but our knowledge of processing 
by these neurons is limited. In V1, complex cells respond to an edge 
over a range of positions’, but classical models of these cells”’® fail to 
explain a number of nonlinear effects, such as surround suppression 
and cross-orientation inhibition*~. More importantly, there is no 
functional explanation for the role of these behaviours in the percep- 
tion of natural scenes. In higher visual areas such as V2 and V4, 
neurons are more invariant to image properties such as position 
and scale'’"* and might be encoding shape or texture'*’*"”. For these 
neurons to generalize effectively, the neural circuitry must generate a 
representation that is similar across the wide distribution of images of 
a given type (for example, a texture or contour) yet distinct across the 
much larger distribution of all other images. 


Previous theoretical work has shown that neurons in the primary 
visual cortex form an efficient code adapted to the statistics of natural 
images’®’’, but this says nothing about how neurons generalize across 


Figure 1 | Statistical patterns distinguish local regions of natural scenes. 
a, A natural scene with four distinct regions outlined (image courtesy of 
E. Doi). b, The scatter plot shows the joint output of a pair of linear feature 
detectors (oriented Gabor filters) for 20 X 20-image patches sampled from 
the four regions. The outputs from different regions are highly overlapping, 
indicating that linear features provide no means to distinguish between the 
regions. ¢, Each column shows the joint output of a different pair of linear 
feature detectors sampled from the regions containing the tree bark or the 
tree edge (the first column corresponds to features in b). The correlations in 
each panel can be described by a Gaussian distribution and its covariance 
(ellipses). The differences in the distributions between the rows reveal 
characteristic patterns in correlations, which become even more prominent 
as projections onto more features are considered. These patterns can be used 
to generalize within regions while still distinguishing among them. As an 
example, we highlight two patches in each region, shown by the circle and 
triangle in each panel. Although the pairs of images are visibly quite 
different, each image is consistent with the distribution of the local image 
region. By contrasting the distributions across multiple dimensions, it is 
possible to infer image type for single patches, even if the patches have 
similar projections along some image features. 
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the intrinsic variability of scene elements. Here we extend the effi- 
cient coding approach and propose that an important aspect of visual 
computation is to represent the myriad statistical distributions 
that characterize local image regions. Rather than coding the pixel 
intensities of a patch of texture or edge, neurons in later stages encode 
the image distribution (that is, the range and pattern of variability of 
pixel intensities or image features) that is most consistent with the 
input image. This allows the neural representation to generalize 
across individual fixations and convey more abstract properties of 
the image. We demonstrate that a model designed around this com- 
putational goal and optimized for natural scenes explains nonlinear 
properties of complex cells and neurons in higher visual areas, 
thereby providing a new functional interpretation for these effects. 

Fundamentally, generalization is the identification of common 
characteristics of a class from specific instances. The goal of the 
proposed model is to learn the statistical distributions that character- 
ize local image regions, such as those in Fig. 1, and identify them from 
individual image patches. What statistical regularities are relevant for 
this task? As the examples in Fig. 1 suggest, the distributions of 
perceptually similar images show consistent patterns in the degree 
of variation along some dimensions, as well as in the strength of 
correlations (and anti-correlations) among different feature dimen- 
sions. Although these patterns appear subtle when projected onto 
two dimensions, as in the examples, the full multivariate distribution, 
consisting of hundreds of dimensions, produces prominent statistical 
signatures that can be exploited by the visual system. 

To determine how the model generalizes, we must specify how it 
represents distributions of local image regions. A simple way to sum- 
marize the patterns of correlations for a given type of image is the 
covariance matrix of the data. A neural code for this structure could be 
defined by enumerating the set of observed covariances and assigning 
one neuron to each pattern, but this approach presents two problems. 
First, local image classes are not known a priori. Second, given the 
limited number of neurons in the visual system, it is not feasible to 
represent all possible image types, let alone the combinatorial number 
of possible image boundaries. Instead, we propose a distributed code 
in which the graded activity of the neural population acts to describe a 
continuum of potential covariance patterns. 

This distribution coding model is illustrated schematically in Fig. 2. 
The model represents the correlations present in local image regions 
with a multivariate Gaussian distribution that has a fixed mean of 
zero and a covariance that is a function of the neural activity (see 
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Figure 2 | Distribution coding model. Rather than encoding the precise 
pixel values of an input image (bottom), the proposed model infers for each 
image the most likely distribution (ellipses) containing it. Activation 
patterns for model neurons are shown at the top of each column. Absence of 
activity corresponds to the lack of image structure (left panel) —that is, a 
canonical distribution that reflects the statistics over all natural images 
(black circle). Increased neural activity represents deviations from this 
canonical distribution and captures statistical patterns in local image regions 


(middle and right panels, patches and symbols as in Fig. 1). In each panel, the 
activation pattern is the same for both inputs. See text for further details. 
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Methods). This simple statistical description affords both the flex- 
ibility to capture a continuum of natural image distributions and 
mathematical simplicity for tractable parameter estimation. The 
model uses two sets of parameters to describe correlations in image 
distributions. First, the vectors b; (arrows within circles) specify 
image features along which the encoded distribution can be 
expanded or contracted relative to the canonical distribution (black 
circle). These vectors are shared by all neurons in the model (repre- 
sented by the four grey circles, each of which contains the same set of 
arrows). Because these vectors do not necessarily line up with the axes 
of the input dimensions, changes in variation along a vector can 
correspond to changes in the correlational pattern in many dimen- 
sions at once. Neurons in the model (y;) describe changes along these 
directions using weights w;,: each has a different set of weights, cor- 
responding to an expansion or contraction along feature b;. A pos- 
itive weight (red) means that the neuron responds to a wider range of 
stimuli along that direction, a negative weight (blue) means it 
responds to a narrower range, and a weight close to zero (grey) 
indicates that the neuron is neutral to this direction. The combined 
activation of all neurons specifies the final shape of the encoded 
distribution (ellipses). Given a single fixation—one input image— 
the model computes the neural representation (that is, the image 
distribution) that provides the most probable explanation of the 
input. The model is able to generalize over different image regions 
if the inferred representation is similar across a region (for example, 
for the pairs of patches in Fig. 2). 

By adapting model parameters b; and w;, to the data, we are able to 
find the most efficient way to use a limited number of neurons to 
describe the wide range of distributions observed in natural images. It 
should be noted that, although our goal is to derive a stable repres- 
entation of all patches within a local region, no assumptions about 
locality are made (encoding is done independently for each image 
patch). It is the task of the model to learn a compact representation of 
all patches and to automatically discover which share the statistical 
properties of a particular type. 

If, as hypothesized, neurons in the visual cortex encode patterns in 
correlations in local regions and are adapted specifically to the stat- 
istics of natural scenes, we expect the representations learned by the 
model to reflect properties of visual neurons. To this end, we trained 
the model on patches sampled from a large set of natural images and 
examined the resulting parameters as well as the response properties 
of model neurons to natural images. 

The vectors b; encode the directions of common expansion or 
contraction in the shape of the image distribution. Drawn as image 
patches, each is an oriented and localized edge-like feature. The full 
set tiles the spatial extent of the image patch (Fig. 3a) and spans the 
range of orientations and spatial frequencies of natural images (not 
shown). These oriented, band-pass image features are consistent with 
the optimal images for exciting simple cells in the primary visual 
cortex'*!’, Similar representations have been derived previously 
using linear statistical models that maximize the efficiency of the 
image codes'®!”. In the model proposed here, however, these features 
are not used explicitly to reconstruct the original image, but instead 
function to modify the encoded distributions (arrows in Fig. 2). 
Thus, whereas the traditional interpretation of early sensory codes 
is that they are adapted for faithful reconstruction of the stimulus, 
our results suggest an additional interpretation: they convey varia- 
tions in image distributions and allow downstream visual areas to 
form more abstract representations. 

The second set of parameters, the weights w;,, describes the role of 
each neuron in shaping the encoded image distribution. A set of 
learned weights for a typical model neuron is shown in Fig. 3b. 
This neuron exerts the strongest effect on features in the top left of 
the image patch, increasing the variability (that is, activation) of 
those oriented at its ‘preferred’ orientation of 45°, decreasing the 
variability of those at the orthogonal orientation, as well as those at 
the preferred orientation but at an offset location. Rather than 
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responding to a few excitatory or suppressive image features, the 
neuron integrates a large number to describe a pattern of variability 
underlying a particular image distribution. Although the functional 
significance of these subunits is to modify the statistical structure of 
the encoded distribution, they also reflect stimulus features to which 
this model neuron is most sensitive. It should be noted that a model 
neuron is activated by all images from this distribution, rather than 
signalling the presence of one best stimulus. Conversely, stimuli that 
lie in parts of image space assigned low probability by the neuron 
inhibit its activity. 

To compare the behaviour of the model neuron to that of cells in 
the visual cortex, we tested its response to stimuli used in classical 
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Figure 3 | Model neurons exhibit properties of cortical visual neurons. 

a, When adapted to natural images, the vectors b;, are oriented, localized in 
space, and span the spatial extent of the 20 < 20-pixel image patch. Each line 
reflects the orientation, spatial position within the image patch, and scale 
(length of line) of one of the image features. Twenty-five representative 
features (from a total of 1,000) are drawn in black, and shown in image form 
on the right. b, Weights of one typical model neuron to the features b;. As in 
a, each feature is represented by a line, and the colour of the line indicates the 
sign and magnitude of the weight to the feature (see colour bar). Positive 
weights indicate increased variability in the corresponding feature; negative 
weights indicate decreased variability; features to which the neuron is 
insensitive are shaded grey. Image features (b,) corresponding to the five 
most positive and the five most negative weights are shown in the right panel; 
the corresponding weights are above each feature. These act as excitatory 
and inhibitory subunits for this neuron. c, When presented with sinusoidal 
gratings, this model neuron replicates common aspects of the neural 
response in complex cells in cortical area V1. It is highly tuned to the 
grating’s orientation, but insensitive to its phase. Adding a grating into the 
surrounding region suppresses the response (third plot, 0°) relative to 
baseline response to a single grating (asterisk), but this suppression is tuned 
to the orientation of the surround and is weakest when the surround is 
orthogonal to the preferred orientation (90°). Masking with a superimposed 
orthogonal grating suppresses the response (fourth plot, 90°), but this 
suppression is also orientation-dependent. All model neuron responses are 
plotted on the same scale (red axis); cell firing rates in each plot were 
normalized to a maximum value of 1; preferred orientation was shifted to 0° 
for the model neuron and the cell in all plots. 
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physiological experiments (sinusoidal gratings). Model parameters 
were fixed after training on natural images, and neural response 
computed on a set of patterns centred in the visual area that evoked 
maximal response. This particular model neuron showed a variety of 
properties observed in complex cells in V1 and cells in V2, including 
phase invariance, orientation tuning and complex suppressive effects 
(Fig. 3c). A large subset of the population exhibits similar properties, 
whereas others encode more complex patterns that have been 
observed in higher visual areas V2 and V4 (a detailed analysis of 
the population and similarities to other experimental data are pro- 
vided in the Supplementary Information). We emphasize that these 
results, as well as image features described in Fig. 3a, were obtained 
with no assumptions about the image structure encoded by visual 
neurons and without fitting a model to data from physiological 
experiments. Specifically, we did not restrict the encoded image fea- 
tures to be localized and oriented, nor did we prescribe in advance 
how the subunits are to be combined in the pattern represented by 
each neuron. 

Finally, we looked at the way in which the model uses the popu- 
lation of neurons to represent images. If the model is able to general- 
ize across the wide variability present in natural images, then image 
patches that are widely scattered in the original space of linear fea- 
tures should be tightly clustered in the space of the model’s repres- 
entation. This can be illustrated by projecting into two dimensions 
(as was done with image space in Fig. 1) the model representation of a 
collection of images (Fig. 4). As hypothesized, by encoding image 
distributions rather than the precise feature content of each image, 
model neurons are able to encode perceptually similar images with 
similar representations and to separate distinct image types. 

One limitation of the statistical framework used here is that it does 
not furnish an explicit feed-forward algorithm for neural encoding. 
Nevertheless, it is possible to approximate inference in the model bya 
sequential feed-forward computation: a neuron integrates the 
squared responses of a large number of image features b; and corre- 
lates the pattern against its weights w, (see Supplementary 
Information for details). This computation can be viewed as a gen- 
eralization of the standard model of complex cells, in which each 
complex cell takes as input the squared output of two simple 
cells”!°?°?!, In contrast, model neurons can receive many inputs, 
and the linear features themselves are learned. We find that the 
optimal number of input features varies greatly, and the features 
are integrated in a variety of ways. These predictions are consistent 
with recent analyses of functional subfields in V1 complex cells®”. In 
addition, some model neurons integrate more complex spatial pat- 
terns (see Supplementary Information), which predicts a neural res- 
ponse to a richer variety of images than has been tested 
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Figure 4 | Generalization across natural variability. a, In contrast to linear 
projections (compare to Fig. 1b), a two-dimensional projection of the 
model’s representation (the activity of 150 model neurons) reveals well- 
separated clusters. b, Each 3 X 3-image group corresponds to the array of 
symbols in a. Despite the variability in the appearance of edges and textures, 
the model’s representation of natural images generalizes within each region 
while still distinguishing among them. 
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physiologically. Experiments that incorporate such stimuli will pro- 
vide an important validation of the proposed model. 

The nonlinear effects shown by neurons in the model (Fig. 3c) have 
been previously incorporated into models of complex cells**7°”’. 
Much of this work has focused on fitting mathematical models to 
neural data>*”°’ and does not provide a functional explanation of 
the observed neural properties. Other models have been motivated by 
specific computational goals, such as statistical independence™*”, 
stability of representation over time*®”’, or position or scale invari- 
ance**. However, these models do not explicitly address the problem 
of generalization, which here is performed by inferring the statistical 
distribution that is most likely to explain the input image. An import- 
ant advantage of our approach is that, rather than assuming invari- 
ance (or sensitivity) to limited stimulus parameters such as position 
or orientation, the model learns a much more general set of features 
that are determined by the statistical structures in natural images. If 
higher-level visual neurons are generalizing according to these stat- 
istics, they should have invariance along specific stimulus dimen- 
sions, and their responses to natural images should reflect 
common statistical structure in local image regions. Thus, the model 
provides a quantitative way to explore neural responses to complex 
stimuli characterized by their statistical regularities. 


METHODS SUMMARY 


The model describes individual image patches x with multivariate Gaussian 
probability distributions: 


P(xly) = (n, C) (1) 


with mean pt = 0 and with covariance a function of the neural encoding vector 
C= fly). The logarithm of the covariance matrix is given by the combination of 
outer products of feature vectors b,, weighted by neural activities y; through 
weights wi: 


logC= S > ywabibf (2) 
ik 

Because a different covariance can be inferred for each image, the distribution 
over the entire ensemble of images is highly non-Gaussian. (This model is a 
generalized version of the hierarchical model described previously”, which cap- 
tured patterns among the variances, but not the correlations, of linear features.) 
We trained the model on a large set of 20 X 20 image patches, sampled ran- 
domly from greyscale photographs of outdoor scenes’”. The number of neurons 
was set to 150 and the number of linear features b; to 1,000. The ‘response’ of 
model neurons was computed as the most probable neural representation given 
the input image by maximizing the posterior probability P(y|x, {b,,wj;}). Model 
parameters were initialized to small random values and optimized by maximiz- 
ing the likelihood of the data under the model P(x|{b;,w;}) using standard 
gradient ascent. 
For the ‘physiological’ analysis of Fig. 3c, we first identified the location, 
orientation, and spatial extent and frequency of a windowed sinusoidal grating 
that best activated the model neuron (one that yielded the most positive value of 
yj). We then varied each tested parameter and computed the model’s representa- 

tion of the stimulus (the vector of responses of model neurons). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Data. We used 110 greyscale images of outdoor scenes as training data’’. Pixel 
intensities were log-transformed (corresponding roughly to the transformation 
at the retinal cone cells*’), and the images were low-pass filtered to remove corner 
frequency sampling artefacts. We randomly extracted overlapping 20 X 20- 
image patches from the entire data set. The mean luminance value was sub- 
tracted from each patch (which sped up model training but had no significant 
influence on the results). We ‘whitened’ all image patches to remove global 
correlations and to normalize the variance; this allowed the model to encode 
only the deviations of each image distribution from the global statistics (the 
canonical distribution). For visualization of image features, the results were 
projected back into the original image space. All stimuli in the physiological 
analysis of Fig. 3c were preprocessed in the same way as the natural images used 
in training. 

Model parameter estimation. We estimated the optimal model parameters 
0 = {b;,wjx} by maximizing the likelihood of the data under the model 


P(x|0) = | Paty 0) Ply)dy 3) 


The conditional distribution P(xly, 0) is a multivariate Gaussian that captures 
correlations in local image regions (equation (1)). Neural activities were 
assumed to be sparse*! and independent, and were modelled with a Laplacian 
yj 


(symmetric exponential) distribution, P(y) = TIP( yj) ale !, The integral 
over all possible neural states in equation (3) is intractable and was replaced 
by a single evaluation at the maximum a posteriori value y = arg max, P(y|x, 0). 
Although this approximation ignores the volume around the maximum, it is one 
standard approach to tackling this problem. 

We assumed the training patches were sampled independently and that the 

likelihood for the data ensemble was a product of terms for individual images 
(equation (3)). In practice, we maximized the log-likelihood using gradient 
ascent on batches of 100 image patches. Repeated training runs produced con- 
vergence to similar parameter values. 
Model responses to grating stimuli. The orientation tuning of model neurons 
in Fig. 3c was measured using 20 X 20 patches of sinusoidal gratings at different 
positions, orientations, spatial frequencies and phases. We first eliminated neu- 
rons that were ‘unresponsive’ to gratings, that is, those whose maximal response 
did not reach 2 standard deviations of the population response to gratings. This 
was necessary to discount small random activation of neurons tuned for other 
types of image structures. For each neuron we found the grating with the max- 
imal response and measured modulation in response to varying orientation, 
phase, or the addition of masks in the receptive field or the surround. Because 
neural activity could be positive or negative, the full amplitude of modulation 
was considered as twice the maximum absolute value of the response. 

A neuron was considered to be orientation-tuned if its response was modu- 
lated by more than 50% over the range of stimulus orientations, and to be phase 
invariant if the response varied less than 50% over phase-shifted gratings. Cross- 
orientation inhibition and surround suppression corresponded to greater than 
25% decrease in neural response. Bandwidth of orientation tuning was com- 
puted as the width at 1 / \/2 of the full amplitude of the response modulation. 

The projection of neural activity in Fig. 4 was computed using linear discrim- 
inant analysis, a technique that finds the linear projections that best separate 
different classes of data. Applied to the raw pixel data or to the outputs of linear 
features (data shown in Fig. 1), this method failed to separate the clusters. 
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Immune control of an SIV challenge by a T-cell-based 


vaccine in rhesus monkeys 
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Peter Abbink’, Rory T. Coffey’, Lauren E. Grandpre', Michael S. Seaman’, Gary Landucci*, Donald N. Forthal’, 
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A recombinant adenovirus serotype 5 (rAd5) vector-based vaccine 
for HIV-1 has recently failed in a phase 2b efficacy study in 
humans’. Consistent with these results, preclinical studies have 
demonstrated that rAd5 vectors expressing simian immunodefi- 
ciency virus (SIV) Gag failed to reduce peak or setpoint viral loads 
after SIV challenge of rhesus monkeys (Macaca mulatta) that 
lacked the protective MHC class I allele Mamu-A*01 (ref. 3). 
Here we show that an improved T-cell-based vaccine regimen 
using two serologically distinct adenovirus vectors afforded sub- 
stantially improved protective efficacy in this challenge model. 
In particular, a heterologous rAd26 prime/rAd5 boost vaccine 
regimen expressing SIV Gag elicited cellular immune responses 
with augmented magnitude, breadth and polyfunctionality as 
compared with the homologous rAd5 regimen. After SIVyacos1 
challenge, monkeys vaccinated with the rAd26/rAd5 regimen 
showed a 1.4log reduction of peak and a 2.4log reduction of 
setpoint viral loads as well as decreased AIDS-related mortality 
as compared with control animals. These data demonstrate that 
durable partial immune control of a pathogenic SIV challenge for 
more than 500 days can be achieved by a T-cell-based vaccine in 
Mamu-A*01-negative rhesus monkeys in the absence of a homo- 
logous Env antigen. These findings have important implications 
for the development of next-generation T-cell-based vaccine can- 
didates for HIV-1. 

Recombinant Ad5 vector-based vaccines expressing SIV Gag have 
been shown to afford dramatic control of viral replication after 
simian—human immunodeficiency virus (SHIV)-89.6P challenge of 
rhesus monkeys*”. However, rAd5-Gag vaccines have failed to reduce 
peak or setpoint viral loads after SIV\y,c239 challenge of rhesus mon- 
keys’, highlighting important differences in the stringencies of these 
challenge models. Heterologous DNA prime/rAd5 boost vaccine 
regimens have also failed to date to reduce setpoint viral loads after 
SIVmac Challenge of rhesus monkeys that lacked the protective MHC 
class I allele Mamu-A*01 (refs 3 and 6). The inability of vector-based 
vaccines to afford durable control of setpoint viral loads after SIV\yjac 
challenge of Mamu-A*01-negative rhesus monkeys has led to sub- 
stantial debate about the viability of the concept of developing T-cell- 
based vaccines for HIV-1. 

Pre-existing Ad5-specific neutralizing antibodies have been 
reported to reduce the immunogenicity of rAd5 vector-based vac- 
cines in clinical trials’* and may also compromise their safety’. Rare 
serotype rAd vectors—such as rAd26 and rAd35 vectors’ '*—have 
been developed as potential alternatives. Serologically distinct rAd 
vectors also allow the development of more potent heterologous rAd 
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Figure 1| Immunogenicity of heterologous rAd prime-boost vaccine 
regimens. Rhesus monkeys were primed at week 0 and boosted at week 24 
with rAd26/rAd5, rAd35/rAd5 or rAd5/rAd5 regimens expressing SIV Gag. 
a, Gag-specific IFN-y ELISPOT assays were performed at weeks 0, 2, 24, 26 
and 52 after immune priming. b, CD4* (white bars) and CD8* (black bars) 
T lymphocyte responses were evaluated at week 28 by CD8-depleted and 
CD4-depleted ELISPOT assays, respectively. SFC, spot-forming cells. c, The 
breadth of responses was determined by Gag epitope mapping at week 28. 
d, Gag-specific antibody responses were determined by ELISA at week 28. 
Mean responses with standard errors are shown (a—d). e, Functionality of 
Gag-specific CD8* and CD4* central memory (CM; CD28*CD95~) and 
effector memory (EM; CD28" CD95~) T lymphocyte responses were 
assessed by 8-colour ICS assays. Proportions of IFN-y, TNF-« and IL-2 
responses are depicted individually and in all possible combinations for each 
cellular subpopulation. CD4* EM responses after rAd5/rAd5 immunization 
were below the detection limit of the assay. 
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prime-boost vaccine regimens’’. To investigate the immunogenicity 
and protective efficacy of such regimens, we immunized 22 Indian- 
origin rhesus monkeys (Macaca mulatta) that lacked the protective 
MHC class I alleles Mamu-A*01 (refs 13-15) and Mamu-B* 17 (ref. 
16) with the following heterologous or homologous rAd prime-boost 
regimens: (1) rAd26-Gag prime/rAd5-Gag boost (1 = 6); (2) rAd35- 
Gag prime/rAd5-Gag boost (n = 6); (3) rAd5-Gag prime/rAd5-Gag 
boost (n= 4); and (4) sham controls (1 = 6). One monkey each in 
groups 1, 3 and 4 expressed the protective Mamu-B*08 allele. 
Monkeys were primed at week 0 and boosted at week 24 with 10'' 
viral particles of each vector expressing SIVacz39 Gag. At week 52, 
all animals received a high-dose intravenous (i.v.) challenge with 100 
infectious doses of SIVyacos1 (ref. 6). 

We monitored vaccine-elicited SIV Gag-specific cellular 
(Fig. la—c) and humoral (Fig. 1d) immune responses in these animals 
before challenge. After the priming immunization, IFN-y enzyme- 
linked immunospot (ELISPOT) responses to pooled SIV Gag pep- 
tides were observed in all vaccinees. Monkeys primed with rAd26- 
Gag or rAd35-Gag were efficiently boosted by the heterologous rAd5- 
Gag vector to peak responses of 2,513 and 1,163 spot-forming cells 
per 10° peripheral blood mononuclear cells, respectively, 2 weeks 
after the boost immunization (Fig. 1a; green bars). In contrast, mon- 
keys primed with rAd5-Gag were only marginally boosted by a sec- 
ond injection of rAd5-Gag as a result of anti-vector immunity 
generated by the priming immunization''’’. Cell-depleted 
ELISPOT assays demonstrated that these responses were primarily 
CD8* T lymphocyte responses, although lower levels of CD4* T 
lymphocyte responses were also clearly observed (Fig. 1b). Epitope 
mapping was then performed by assessing ELISPOT responses 
against all 125 individual 15-amino-acid SIV Gag peptides after the 
boost immunization. The rAd26/rAd5 regimen elicited a mean of 8.6 
detectable Gag epitopes per animal, whereas the rAd35/rAd5 regimen 
elicited a mean of 4.5 epitopes per animal and the rAd5/rAd5 regimen 
induced a mean of only 2.2 epitopes per animal (Fig. 1c). These data 
demonstrate that the heterologous rAd26/rAd5 regimen induced an 
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Figure 2 | Protective efficacy of heterologous rAd prime-boost vaccine 
regimens. Monkeys were challenged intravenously with SIVMacos1, and 
protective efficacy was monitored by SIV RNA levels (a, b), CD4* T 
lymphocyte counts (c), and clinical disease progression and mortality 
(d) after challenge. Viral loads are depicted longitudinally for each group 
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8.7-fold greater magnitude and a 3.9-fold increased breadth of Gag- 
specific cellular immune responses as compared with the homolo- 
gous rAd5/rAd5 regimen. 

We next assessed the functionality of the vaccine-elicited, Gag- 
specific T lymphocyte responses by multiparameter flow cyto- 
metry'*’’. Intracellular cytokine staining (ICS) assays were 
performed to assess IFN-y, TNF- and IL-2 secretion in CD8* and 
CD4* central memory (CD28*CD95*) and effector memory 
(CD28 CD95*) T lymphocyte subpopulations”. Consistent with 
our previous findings'', we observed larger proportions of polyfunc- 
tional IFN-y/TNF-«/IL-2-positive (black) and IL-2-positive (yellow) 
CD8* and CD4* T lymphocyte responses in the animals that 
received the heterologous rAd26/rAd5 regimen as compared with 
the homologous rAd5/rAd5 regimen (Fig. le). In contrast, the 
rAd5/rAd5 regimen elicited primarily IFN-y-positive (red) and 
IFN-y/TNF-c-positive (blue) responses. 

Six months after the boost immunization, all animals were chal- 
lenged intravenously with SIV)\yac251- Protective efficacy was evalu- 
ated by monitoring plasma SIV RNA levels (Fig. 2a, b), CD4* T 
lymphocyte counts (Fig. 2c), and clinical disease progression and 
mortality (Fig. 2d) after challenge. Monkeys that received the 
rAd26/rAd5 regimen showed mean peak viral loads that were 
1.43 logs lower than the mean peak viral loads in the control animals 
(Fig. 2a, b; P= 0.002, two-sided Wilcoxon’s rank-sum test with mul- 
tiple comparison adjustments). More importantly, the rAd26/rAd5 
vaccinated animals also demonstrated durable partial control of set- 
point viral loads, as defined as the mean SIV RNA levels from day 112 
to day 420 after challenge. In particular, the rAd26/rAd5-vaccinated 
monkeys maintained a 2.44log reduction of setpoint viral loads as 
compared with the control animals (Fig. 2a, b; P= 0.01). If the three 
Mamu-B*08-positive animals are excluded from the analysis, then 
the rAd26/rAd5 regimen afforded a 2.66 log reduction of setpoint 
viral loads as compared with the control animals (P= 0.008; data 
not shown). As expected, the homologous rAd5/rAd5 regimen 
afforded no detectable control of setpoint viral loads, consistent with 


c 

2,500 Ad26/Ad5 Ad35/Ad5 
n 
oD 
Ss 
[e} 
fe) 
p= 
[on 
g 
_ 2,500 Ad5/Ad5 Sham 
3 2,000 
© 1,500 

1,000 . 

500 1 . 
0+ . . ~. " + : : . ; 
100 200 300 400 0 100 200 300 400 
Days after challenge 

d 100 P=0.03 

_ 80 

& — Ad26/Ad5 

3 0 — Ad35/Ad5 

2 a — Ad5/Ad5 

Pal —— Sham 

20 
0 100 200 300 400 500 


Days after challenge 
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the previously reported failure of this regimen in this stringent SIV 
challenge model’. 

Monkeys that received the rAd26/rAd5 regimen also demonstrated 
slower declines of CD4* T lymphocyte counts as compared with the 
other groups (Fig. 2c). Moreover, the rAd26/rAd5 vaccine afforded a 
significant reduction of AIDS-related mortality as compared with the 
controls at day 500 after challenge (Fig. 2d; P = 0.03, Fisher’s exact 
test), whereas the rAd35/rAd5 and rAd5/rAd5 vaccines provided a 
trend towards a survival advantage. Necropsies showed that the 
causes of death were AIDS-defining illnesses, including 
Pneumocystis pneumonia, Cryptosporidium enteritis, cytomegalo- 
virus pneumonia, SIV encephalitis and lymphoma. 

We next evaluated the evolution of SIV-specific cellular (Fig. 3a, c, 
e) and humoral (Fig. 3b, d) immune responses in these animals after 
challenge. The rAd26/rAd5 vaccinees exhibited tenfold greater ana- 
mnestic Gag-specific ELISPOT responses as compared with the con- 
trols during both acute and chronic infection (Fig. 3a; black bars), 
suggesting the critical importance of these responses for immune 
control. In contrast, Pol-, Nef- and Env-specific cellular immune 
responses were either comparable or lower in magnitude in the 
rAd26/rAd5 vaccinees as compared with the other groups (Fig. 3a). 
The rAd26/rAd5 vaccinees thus showed less extensive diversification 
of cellular immune responses against these other SIV antigens, 


a 5,0007 bb 60 
pey uMedla a Gag ELISA 
4,000} Fag 5.0 m Env ELISA 
@ SIV NAb (TCLA) 
3,000} ane Se @ SIV NAb (primary) 
2,000/ 3.0 
1,000, 2.0 
0 iz eee be 2 
6.0 
5,000 Day 14 
4,000 4 5.0 
3,000 4.0 
2,000 3.0 
1,000 2.0 1 
0 1.0 
6.0 
Oo 5,000 Day 28 a 
3 4,000 = 5.0 
o Ss 
© 3,000 8 40 
32,000 4 = 3.0 
9 1,000 2 20 
0 1.0 
6.0 
5,000 Day 56 
4,000 5.0 
3,000 4.0 
2,000 3.0 
1,000 ih 2.0 
0 atk 1.0 
6,000) nay 449 6.0 
4,000 5.0 
3,000 4.0 
2,000 3.0 
1,000 2.0 
0 1.0 
‘Ad26/" Ad35/" Ad5/ "Sham Ad26/" Ad35/ Ad5/ Sham 
Ads Ad5 Ad5 Ads Ad5 —AdS 


Figure 3 | Cellular and humoral immune responses after challenge. 
Cellular (a, ¢, e) and humoral (b, d) immune responses were assessed after 
challenge. IFN-y ELISPOT assays (a) and ELISA and neutralizing antibody 
assays (b) were performed on days 0, 14, 28, 56 and 112 after challenge. NAb, 
neutralizing antibody; primary, primary isolate virus; TCLA, T-cell 
laboratory-adapted virus. Eight-colour ICS assays on day 28 (c), ADCVI 
assays at 1:100 serum dilution at the time points indicated (d), and Gag 
epitope mapping studies on day 28 (e) were also performed. Mean responses 
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presumably reflecting the reduced viral replication in these animals. 
Phenotypic analysis of the Gag-specific CD8* T lymphocyte res- 
ponses in the rAd26/rAd5 vaccinees demonstrated slightly higher 
expression of IFN-y and TNF-« but 6.7-fold greater expression of 
IL-2 as compared with the rAd5/rAd5 vaccinees after challenge 
(Fig. 3c; green bars). These data suggest that not only quantitative 
but also qualitative differences among the vaccine regimens may have 
contributed to protective efficacy. 

Weassessed the breadth of Gag-specific cellular immune responses 
on day 28 after challenge by epitope mapping using 15-amino-acid 
SIV Gag peptides (Fig. 3e). The monkeys that received the rAd26/ 
rAd5 regimen developed a mean of 16.1 (range 4-29) Gag epitope- 
specific responses after challenge. In contrast, the control animals 
developed <1 detectable Gag epitope-specific response after chal- 
lenge. Notably, both the breadth and the magnitude of Gag-specific 
cellular immune responses elicited by vaccination (P= 0.0049 and 
P= 0.0025, respectively, Spearman’s rank-correlation tests) and 
recalled after challenge (P = 0.018 and P = 0.024, respectively) cor- 
related with control of setpoint viral loads in these animals (Fig. 3f). 

Humoral immune responses were evaluated by Gag- and Env- 
specific ELISAs as well as by luciferase-based pseudovirus neutral- 
izing antibody assays using both laboratory-adapted and primary 
isolate SIV\aczs1 Viruses” (Fig. 3b). The vaccinees developed more 
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with standard errors are shown (ae). f, Correlations between the breadth 
(left panels) or magnitude (right panels) of pre-challenge (top panels) or 
post-challenge (bottom panels) Gag-specific cellular immune responses and 
setpoint viral loads were evaluated by two-sided Spearman’s rank correlation 
tests. Monkeys immunized with rAd26/rAd5 (triangles), rAd35/rAd5 
(diamonds), rAd5/rAd5 (squares) and sham (circles) vaccine regimens are 
depicted. 
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rapid kinetics of Gag-specific ELISA responses as compared with the 
controls after challenge, but no clear differences of Env-specific 
ELISA or neutralizing antibody responses were observed among 
groups. Compared with controls, the vaccinees also developed more 
rapid kinetics of antibody-dependent cell-mediated virus inhibition 
(ADCVI)” on day 14 after challenge (Fig. 3d; P= 0.01), suggesting 
the potential functional relevance of nonclassical antibody responses 
in early SIV infection. 

We next monitored CD4* T lymphocyte dynamics in these animals 
after challenge. The monkeys that received the rAd26/rAd5 regimen 
had a relative preservation of central memory CD4* T lymphocytes® 
(Fig. 4a; black bars; P = 0.02, Wilcoxon’s rank-sum test) as well as a 
more marked preservation of CCR5" central memory CD4~ T lym- 
phocytes**”> (Fig. 4b; black bars; P= 0.002) as compared with the 
controls on day 14 after challenge. The rAd26/rAd5 vaccinees also 
showed reduced Ki67* proliferation of CCR5* central memory 
(Fig. 4c, d; P= 0.004) and effector memory (Fig. 4e, f; P= 0.004) 
CD4* T lymphocytes as compared with the controls after challenge. 
In addition, on day21 after challenge, monkeys that received the 
rAd26/rAd5 regimen maintained markedly higher levels of gastro- 
intestinal CD4~ T lymphocytes in duodenal biopsies as compared with 
the controls, which showed extensive depletion of this lymphocyte 
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Figure 4 | CD4* T lymphocyte dynamics after challenge. Preservation of 
central memory (CM; CD28* CD95*) CD4* T lymphocytes (a) and CCR5* 
central memory CD4~ T lymphocytes (b) was assessed on days 0, 14, 28, 56 
and 112 after challenge. Ki67 staining of central memory CD4* (c), CCR5* 
central memory CD4* (d), effector memory (EM; CD28 CD95*) CD4* 
(e), and CCR5™ effector memory CD4* (f) T lymphocytes was also 
determined. Mean responses with standard errors are shown 

(a-f). g, Preservation of gastrointestinal CD4~ T lymphocytes was assessed 
in duodenal biopsies on day 21 after challenge. Comparisons were 
performed by two-sided Wilcoxon’s rank-sum tests. 
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population as expected**** (Fig. 4g; P= 0.02, Wilcoxon’s rank-sum 
test). 

Our data demonstrate that the heterologous rAd26/rAd5 regimen 
elicited improved magnitude, breadth and functionality of Gag-spe- 
cific cellular immune responses as compared with the homologous 
rAd5/rAd5 regimen and afforded durable partial immune control of 
a homologous SIV\acos1 challenge in rhesus monkeys. To the best of 
our knowledge, vector-based vaccines have not previously been 
reported to reduce setpoint viral loads in the stringent system of 
SIVmac challenge of Mamu-A*01-negative rhesus monkeys”®, 
although a previous study has shown partial control of setpoint viral 
loads using a DNA/rAd5 regimen in the less stringent system of 
Mamu-A*01-positive rhesus monkeys”. In the present study, the 
breadth and magnitude of vaccine-elicited, Gag-specific cellular 
immune responses before challenge correlated with control of set- 
point viral loads after challenge (Fig. 3f), suggesting the critical 
importance of Gag-specific T lymphocyte responses in controlling 
viral replication. Protective efficacy was also associated with 
improved functionality of Gag-specific T lymphocyte responses 
(Figs le and 3c), reduced diversification of responses against other 
SIV antigens after challenge (Fig. 3a), and preservation of CCR5* 
central memory CD4* T lymphocytes and gastrointestinal CD4* T 
lymphocytes after challenge (Fig. 4b, g), although some of these 
parameters may reflect the result rather than the cause of the reduced 
viral replication. 

It is important to highlight the fact that the vaccines used in the 
present study expressed only a single SIV Gag antigen and did not 
include a homologous Env immunogen. The observed protective 
efficacy was therefore presumably mediated by vaccine-elicited 
Gag-specific cellular immune responses, because it is unlikely that 
Gag-specific antibodies afforded substantial protection. Consistent 
with these data, a previous study reported that the breadth of Gag- 
specific cellular immune responses correlated with control of viral 
loads in chronically HIV-1-infected humans”. It remains possible, 
however, that other SIV antigens may also contribute to cellular 
immune protection if included in a multivalent vaccine. Future stud- 
ies should address this possibility and evaluate the protective efficacy 
of optimal vaccine regimens against highly heterologous SIV chal- 
lenges. The potential role of early ADCVI responses should also be 
explored, because ADCVI activity emerged more rapidly in the vac- 
cinees as compared with the controls after challenge (Fig. 3d). These 
responses probably reflect immune preservation in the vaccinees 
rather than a direct result of vaccination, because ADCVI activity 
has been reported to be mediated by Env-specific antibodies”. 

Despite current controversies about the use of rAd vector-based 
vaccines for HIV-1, our data have important implications for the 
development of next generation T-cell-based vaccines by the proof- 
of-concept demonstration that durable partial immune control of a 
pathogenic SIV challenge can be achieved in Mamu-A*01-negative 
rhesus monkeys. It remains unclear, however, whether the observed 
protection reflected the use of rAd26 vectors, the heterologous rAd 
prime-boost regimen, or both. Further studies will therefore be 
required to evaluate these possibilities and to determine the use of 
regimens consisting of two rare serotype rAd vectors. In conclusion, 
our findings suggest that T-cell-based vaccine regimens that elicit 
augmented magnitude, breadth and polyfunctionality of cellular 
immune responses as compared with the homologous rAd5 regimen 
may afford superior protective efficacy against HIV-1 and other 
pathogens. 


METHODS SUMMARY 

Animals, immunizations and challenge. Outbred rhesus monkeys that did not 
express the MHC class I alleles Mamu-A*01 and Mamu-B* 17 were housed at 
New England Primate Research Center (NEPRC), Southborough, 
Massachusetts, USA. Immunizations involved 10"! viral particles of replica- 
tion-incompetent, E1/E3-deleted rAd5, rAd35 or rAd26 vectors'*"* expressing 
SIVmac239 Gag and were delivered as 1-ml injections intramuscularly in both 
quadriceps muscles at weeks 0 and 24. Sham controls received 10'' viral particles 
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of empty vectors. At week 52, all animals received an i.v. inoculation of 100 
infectious doses of SIV\acos1 as described®. SIV RNA levels were assessed after 
challenge on days 0, 3, 7, 10, 14 and 21, then weekly until day 42, biweekly until 
day 84, and monthly thereafter (Siemans Diagnostics). All animal studies were 
approved by our Institutional Animal Care and Use Committees (IACUC). 
Cellular immune assays. SIV-specific cellular immune responses were assessed 
by IFN-y ELISPOT assays and multiparameter ICS assays essentially as 
described"'”°*!, ELISPOT assays used pooled or individual SIV Gag peptides. 
Eight-colour ICS assays used pooled SIV Gag peptides and the following mono- 
clonal antibodies (BD Pharmingen): anti-CD3-Alexa700 (SP34), anti-CD4- 
AmCyan (L200), anti-CD8-APC-Cy7 (SK1), anti-CD28-PerCP-Cy5.5 (L293), 
anti-CD95-PE (DX2), anti-IFN-y-PE-Cy7 (B27), anti-IL-2-APC (MQI- 
17H12) and anti- TNF-«-FITC (Mab11). Assessment of T lymphocyte dynamics 
used the following monoclonal antibodies: anti-CD3-Alexa700 (SP34), anti- 
CD4-AmCyan (L200), anti-CD8-APC-Cy7 (SK1), anti-CD28-PerCP-Cy5.5 
(L293), anti-CD95-APC (DX2), anti-CCR5-PE (3A9), anti- HLA-DR-PE-Cy7 
(L243) and anti-Ki67-FITC (B56). Gastrointestinal CD4* T lymphocytes were 
evaluated after collagenase digestion of duodenal biopsies and percoll gradient 
purification. 

Humoral immune assays. SIV-specific humoral immune responses were eval- 
uated by SIV Gag- and SIV Env gp130-specific ELISAs, luciferase-based pseu- 
dovirus neutralization assays (using T-cell laboratory adapted and primary 
isolate viruses)”, and ADCVI assays” essentially as described. 
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Live-animal tracking of individual haematopoietic 
stem/progenitor cells in their niche 


Cristina Lo Celso“, Heather E. Fleming’, Juwell W. Wu>, Cher X. Zhao’, Sam Miake-Lye’, Joji Fujisaki>, 
Daniel Cété*+, David W. Rowe’, Charles P. Lin®* & David T. Scadden’?*® 


Stem cells reside in a specialized, regulatory environment termed 
the niche that dictates how they generate, maintain and repair 
tissues’*. We have previously documented that transplanted 
haematopoietic stem and progenitor cell populations localize to 
subdomains of bone-marrow microvessels where the chemokine 
CXCL12 is particularly abundant’. Using a combination of high- 
resolution confocal microscopy and two-photon video imaging of 
individual haematopoietic cells in the calvarium bone marrow of 
living mice over time, we examine the relationship of haemato- 
poietic stem and progenitor cells to blood vessels, osteoblasts and 
endosteal surface as they home and engraft in irradiated and c-Kit- 
receptor-deficient recipient mice. Osteoblasts were enmeshed in 
microvessels and relative positioning of stem/progenitor cells 
within this complex tissue was nonrandom and dynamic. Both cell 
autonomous and non-autonomous factors influenced primitive 
cell localization. Different haematopoietic cell subsets localized 
to distinct locations according to the stage of differentiation. 
When physiological challenges drove either engraftment or expan- 
sion, bone-marrow stem/progenitor cells assumed positions in 
close proximity to bone and osteoblasts. Our analysis permits 
observing in real time, at a single cell level, processes that pre- 
viously have been studied only by their long-term outcome at 
the organismal level. 

Mammalian stem cell niches have largely been defined based on 
localization of stem cells, with only limited definition of the hetero- 
logous cells within the niche that regulate stem cell function. None of 
these niches has been examined in a physiological context in vivo. The 
haematopoietic stem cell (HSC) niche has been studied by alteration 
of bone components that induce a change in stem cell function*” or 
by immunohistochemistry*°. The former does not precisely define 
localization and the latter does not define function. Several studies 
have documented the regulatory role of osteoblasts on HSC 
fate’7*!!-!3; however, other studies have shown that most immuno- 
phenotypically defined HSCs can be observed adjacent to the vascu- 
lature in bone marrow’. 

Bone-marrow vasculature includes arteries that penetrate compact 
bone at regular intervals, arborizing into capillaries that converge 
into a central sinus'*. CD31 immunostaining of long bones enabled 
visualization of this vasculature network adjacent to the endosteal 
surface, particularly in trabecular regions where HSCs are known to 
localize (Fig. 1a and data not shown)”””. 

To gain a detailed, dynamic view of haematopoietic stem and 
progenitor cell (HSPC) localization we used intravital microscopy, 
scanning 4mm X 6 mm of mouse calvarium that included the central 
sinus and the surrounding bone-marrow cavities’ (Fig. 1b). HSC 


frequencies in this region are comparable to long bones by immuno- 
phenotype and repopulating ability (Supplementary Fig. 2c, d). 
Because the thickness of long bones precludes analysis by intravital 
microscopy, we cannot confirm that our results extend to them, 
particularly the poorly trabeculated diaphysial regions. 

Combined confocal and two-photon microscopy permitted visu- 
alization to a depth of ~ 150 tm or 40-60% of the bone marrow cavity 
in over 75% of our measurements. Simultaneous detection of bone, 
osteoblasts, vasculature and labelled HSPCs was achieved by second 
harmonic generation (SHG) microscopy (see Methods), osteoblast- 
restricted collagen 1a promoter (Col2.3-GFP) reporter mice’*, non- 
targeted quantum dots (Qdot 655 or 800) injected immediately before 
imaging, and flow cytometrically sorted cells stained with the lipophi- 
lic cyanine dyes Vybrant DiD (1,1'-dioctadecil-3,3,3'-tetramethylin- 
dodicarbocyanine perchlorate) or Dil (1,1'-dilinoleyl-3,3,3’, 
3'-tetramethylindocarbocyanine perchlorate) (Invitrogen)’, respec- 
tively (Fig. 1d—h, Supplementary Fig. 1 and Supplementary Movie 
1). Multiple combinations of cell surface markers were used to yield 
highly enriched long-term reconstituting HSC populations (LT- 
HSCs)'*!7, LKS (that is, lineage” (Lin'’), c-Kit (Kit)* and Scal*) 
CD34 Flk2-, LKS CD150' CD48” and LKS CD48" Flk2™ cells were 
isolated, and represent partially overlapping populations (Supple- 
mentary Fig. 7). Given that all flow cytometric methods of cell isola- 
tion can only enrich for HSCs, we will refer to all the populations used 
as haematopoietic stem/progenitor cells (HSPCs) to account for any 
potential progenitor cell contamination. The isolated cells were not 
adversely affected by DiD staining based on competitive and serial 
reconstitution assays of irradiated hosts (Supplementary Fig. 2a, b). 
Owing to a lack of reporters reliably marking endogenous HSPCs, 
transplant models were used to track the interactions of HSPCs with 
the microanatomy of the calvarium. 

Z-stacks and three-dimensional reconstructions revealed that 
green fluorescent protein (GFP)-positive osteoblasts were large, 
irregular and relatively flat (thickness ~ 10 pm) cells discontinuously 
distributed along the endosteal surface (Fig. 1d—g). Osteoblasts were 
adjacent or in close proximity to vasculature (>60% within 10 um 
and >90% within 20 um of vasculature) (Fig. le). Whereas the mid- 
dle of the cavity contained vasculature that was not associated with 
osteoblasts, the endosteal region demonstrated close co-localization 
of osteoblasts and vessels, suggesting that osteoblast-associated 
HSPCs would probably also be susceptible to paracrine influence 
exerted by vascular/perivascular cells. 

The efficiency of three-dimensional in vivo imaging of HSPCs was 
compared with immunohistochemistry by injecting 375,000 to 6 
x 10° Lin'°” DiD-labelled cells and recording the number of cells 
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observed per in vivo imaged field or per 20 1m fixed, decalcified 
frozen section. In vivo imaging was consistently more sensitive 
(Supplementary Fig. 3b). To define the boundaries of our in vivo 
detection system, we performed dilution experiments and noted a 
linear correlation between the number of DiD~ imaged cells and the 
number of LKS CD34" Flk2 cells injected (Supplementary Fig. 3c). 
The numbers of cells detected when injecting 5-15 X 10° LT-HSC- 
enriched populations were small (3-15 cells per recipient) but con- 
sistent, in keeping with the small fraction of the recipients’ total bone 
marrow represented by calvarium” and the inefficiencies of HSPC 
homing”. 

HSPCs were detected in the bone marrow cavity as early as 20 min 
after injection and were static throughout each 1.5- to 3-h-long 
imaging session. In agreement with a previous study”, cells were 
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Figure 1| The calvarium endosteal niche is perivascular. a, Anti-CD31 
immunofluorescence (red) reveals a complex vascular network in the 
trabecular region of the tibia and along diaphysis endosteal surface (blue: 
4,6-diamidino-2-phenylindole (DAPI) nuclear counter-stain). The right 
panel shows the boxed area at higher magnification and the dashed line 
highlights the endosteal surface. Scale bars: left, 200 j1m; right, 100 1m. BM, 
bone marrow. b, Intravital microscopy was used to scan the region of mouse 
calvarium containing bone marrow (asterisks) within the frontal bones. The 
large box shows the entire scanned area; the small box (arrowhead) shows 
one approximate observed field (330 jum”). Scale bar: 1 mm. ¢, DAPI-stained 
calvarium coronal section indicating cavity morphology (asterisks mark the 
bone marrow) and depth of scanning (bracket). d, xy collage image of bone 
(blue), osteoblasts (green) and vasculature (red) acquired simultaneously 
with two-photon microscopy. (See also Supplementary Movie 1.) e, f, Three- 
dimensional reconstructions of Z-stack with all three components (e) or 
bone and osteoblasts only (f). g, Diagram representing measured distance 
between all osteoblast pixels and the closest vasculature (n = 4 cavities; error 
bars indicate s.d.). h, Rotating three-dimensional model of osteoblasts and 
vasculature. Consistent results were seen with ten mice. Scale bars for 

d, f, h indicate 50 um. 
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dispersed across the imaged region (Supplementary Fig. 4). To assess 
precise anatomical relationships in the niche, we acquired Z-stack 
images containing the best focal plane for the DiD signal and the focal 
plane of the closest endosteal surface (Supplementary Fig. 5). Only 
HSPCs that were within 60 um of an endosteal surface were further 
analysed because beyond that distance we could not distinguish 
between cells located in the central cavity and those adjacent to dee- 
per endosteum beyond our detection range (see Supplementary 
Movie 1). This approach excluded only 13% of all imaged cells, 
mostly detected at depths where SHG, GFP and Qdot signals were 
too faint to be collected reliably. 

We adapted the established lodgement assay’®”' to observe HSPC 
localization within the bone marrow of non-irradiated recipients 
(Fig. 2a). All cells were detected in close proximity to vasculature 
(0-16 jm) and at variable distances from the endosteum, but never 
closer than 30 jum (Fig. 2c). Two weeks after injection, we found single, 
bright, DiD-positive cells located between 15-40 lum from the endo- 
steum (Supplementary Fig. 6a). As expected, no contribution of 
injected HSPCs to the host peripheral blood or bone marrow was 
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Figure 2 | Engrafting HSPCs reach the endosteum. a, b, LKS CD48 Flk2~ 
cells injected into non-irradiated (a, n = 3 mice) or irradiated recipients 
(b, n = 3 mice) were imaged within 5h of transplantation. Two-photon 
microscopy was used to detect collagen bone SHG (blue) and confocal 
microscopy was used to detect DiD signal (white), Qdot vascular dye (red) 
and GFP-positive osteoblasts (green) in all panels apart from bottom right, 
where all signals were acquired with two-photon microscopy. Arrows point 
to single homed HSPCs and the numbers are the xy distance measured from 
each cell to the closest endosteum (edge of blue signal). Scale bar: 50 um. 
c, The shortest three-dimensional HSPC-—endosteal surface distance was 
plotted for each cell imaged in non-irradiated, irradiated and WWv 
recipients. The average distance of HSPCs from the endosteum in irradiated 
and WWY recipients (n = 3 mice) was significantly less than in non- 
irradiated recipients. Red lines represent the mean of all measurements for 
each set of experiments. 
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observed for up to 4 months after transplant (data not shown; see also 
ref. 22). HSPCs injected into non-irradiated hosts localized farther 
than 30 ttm from bone and did not engraft. The lack of engraftment of 
transplanted HSPCs in physiological niches imposes an inherent limi- 
tation on current studies aimed to characterize functional HSCs in 
their physiological niches in vivo. 

To visualize HSPCs that could engraft, lethally irradiated wild-type 
and Col2.3-GFP recipients received HSPC transplants and were 
imaged 20 min to 5h after injection. Irradiation compromised the 
vasculature: the quantum dot fluorescent signal no longer clearly 
demarcated bone-marrow vessels but instead spilled over the entire 
cavity (Fig. 2b, red), revealing progressive damage (Fig. 3a, b). 
Notably, circulating cells followed narrow and well defined traject- 
ories within the quantum dot signal (data not shown), indicating 
that, after irradiation, bone-marrow vasculature becomes permeable 
to small particles such as quantum dots but not blood cells. 

HSPCs injected into irradiated recipients also localized at variable 
distances from the endosteum, but under these conditions 47% of the 
cells were within 15 um of the endosteal surface (Fig. 2c). We mon- 
itored the mice for up to 6 months and observed at least 25% peri- 
pheral blood multilineage engraftment in all recipients, thus 
confirming the functional capacity of the injected cells. 

WWYV mice carry two mutations in the stem-cell-factor receptor 
c-Kit, resulting in an impaired ability of endogenous HSCs to engage 
their niche”. As expected, HSPCs injected in non-irradiated WWv 
mice engraft and overtake endogenous HSC production of all peri- 
pheral blood lineages’ (Supplementary Fig. 6d). Whereas WWv 
mice show extramedullary haematopoiesis, secondary bone-marrow 
transplants confirmed that the injected wild-type HSPCs reside in 
functional, supportive bone-marrow niches (Supplementary Fig. 6e). 
Even in the absence of radiation-damaged vasculature, wild-type 
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Figure 3 | Engraftment is initiated by asynchronous HSPC cell divisions. 
a, HSPC progeny were imaged 1 day after injection in irradiated recipients 
(n = 4 mice), revealing heterogeneity in cell clustering. Blue, bone; red, 
vasculature; green, osteoblasts; white, HSPC progeny. b, Cells were tracked 
from day 0 to day 2 (n = 2 recipient mice) or from day 1 to day 2 (n = 3 
recipient mice) and diverse kinetics of cell division were observed. 

c, Increasing numbers of clusters containing 2 or =3 cells were observed in 
the days after injection (m = 4; error bars indicate s.e.m.). d, When 50% of 
cells were stained with DiD and 50% with Dil before injection, only single- 
colour clusters were observed. Red, DiD; green, autofluorescence. Arrows 
point at each DiD-positive cell within two clusters; arrowheads point at 
autofluorescent cells. Cells accepted for assessment had a dye/ 
autofluorescence signal ratio >2 (in this example 2.82, 3.71, 4.12, 8.22). The 
same analysis was used to validate Dil signals. Scale bars in a, b, d are 50 pum. 
e, Summary of observed cell clusters in three independent experiments. 
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HSPCs lodged closer to bone surfaces in WWv animals than in 
wild-type non-irradiated mice (Fig. 2c). Therefore, multiple settings 
in which donor cells engraft the bone marrow are associated with 
HSPCs in the bone marrow in close proximity to endosteum. 

Engrafting HSCs proliferate within the first 24 h after transplanta- 
tion” and their resulting progeny pass through the multipotent and 
early progenitor states within 4 days”, but great heterogeneity in the 
engraftment features of stem cell populations has been previously 
reported*®. Using our system, the xyz position of individual cells 
and their immediate progeny can be tracked in vivo over time 
(Fig. 3a, b), revealing the progressive appearance of cell clusters 
(Fig. 3c) of decreased dye intensity, consistent with the partitioning 
of DiD label on cell division (Fig. 3b). To test whether cell clusters 
arose from an accumulation of migrating cells, we mixed LT-HSC- 
enriched populations stained with DiD or a second lipophilic dye, 
Dil, and imaged recipient mice 2 and 3 days after injection. Only a 
single dye was observed in cells within each cluster (Fig. 3d, e), con- 
sistent with the proliferation of single cells associated with early 
engraftment. Moreover, we observed 5-bromodeoxyuridine (BrdU) 
uptake by the injected cells using both immunofluorescence and 
fluorescence-activated cell sorting (FACS) analysis (Supplementary 
Fig. 9). Our analysis revealed heterogeneity of cell division patterns at 
the single cell level in vivo. 

We sought to determine whether positioning within the bone-- 
marrow microenvironment was influenced by intrinsic features of 
the haematopoietic cells such as differentiation state. We imaged LT- 
HSC-enriched (LKS CD34 Flk2” ), multipotent-progenitor-enriched 
(MPP; LKS CD34*Flk2*)?”*8 and committed-progenitor-enriched 
(Lin’’Kit"Sca”) populations and the heterogeneous Lin'®” cells 
within hours after transplantation (Fig. 4a, b). LT-HSC-enriched cells 
localized closest to endosteum and osteoblasts, with more mature 
subsets residing progressively farther away. To determine whether cell 
position correlated with cell division, we compared the position of cells 
that had proliferated creating clusters of =3 cells with those that did 
not divide. The quiescent cells were significantly closer to osteoblasts 
(Fig. 4d). 

To determine whether niche variables could influence HSPC 
localization, we injected cells in wild-type or PPR mice (that is, mice 
transgenic for a constitutively active parathyroid hormone/parathy- 
roid hormone related peptide receptor driven by the osteoblast 
specific promoter, Col12.3kb)””. PPR mice have increased osteoblast 
number, trabecular bone volume and HSCs", and have been shown to 
drive expansion of injected HSCs when recipients of bone-marrow 
transplantation*’. Two days after injection, DiD* cells in PPR mice 
were markedly closer to the endosteum even when correcting for the 
reduced bone-marrow cavity size (Fig. 4e and data not shown). 
Therefore, stem cell non-autonomous features of the niche contri- 
bute to the regulation of HSPC localization. 

The data presented here demonstrate that the previously proposed 
dichotomy between distinct osteoblast and perivascular niches is not 
anatomically feasible in the calvarium: osteoblasts are perivascular. 
We cannot exclude perivascular-only niches within the diaphysis of 
long bones or in non-bony tissues such as spleen; however, the 
microarchitecture of trabeculae indicates a joint periendosteal—peri- 
vascular niche. The relationship of transplanted primitive haemato- 
poietic cells to anatomical components of the trabecular niche is 
influenced by cell-intrinsic and niche-intrinsic variables. Cells of 
differing immunophenotype localized differently, with closer prox- 
imity to bone corresponding to a greater enrichment for stem cell 
function. In addition, conditions in the microenvironment that 
enable stem cell engraftment or in which osteoblasts drive HSC 
expansion yield closer association of HSPCs with bone. There is 
anatomical dynamism within the bone marrow, with cell positioning 
reflecting differing physiological demands on stem cells. However, 
we cannot conclude that HSPC engraftment or expansion require 
direct osteoblast contact, as this was not uniformly observed. Rather, 
approximate vicinity may be sufficient for osteoblast alteration of 
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Figure 4 | Cell-dependent and niche-dependent HSPC localization. a,b, LKS 
CD34" Flk2~ (n = 6 recipients) LT-HSC-enriched, LKS CD34" FLk2* MPP 
(n = 3 recipients), Lin'’’Kit* Sca” progenitors (n = 3 recipients) and Lin” 
(n = 4 recipients) populations home to distinct locations, closer to or further 
from endosteum (a) and osteoblasts (b). All imaging was performed within 
5h of transplantation. c, Representative image of a LKS CD34 Flk2- cell 

residing adjacent to an osteoblast and close to endosteum 4 h after injection. 
White, DiD-labelled cell; green, GEP* osteoblasts; red, Qdot vascular dye; 


stem cell function. Gradients of secreted factors and extracellular 
matrix, or events induced in other surrounding cell types, may con- 
tribute to the regulatory function of osteoblasts in the niche. 

We have visualized single transplanted HSPCs in their niche 
within a living mammal and demonstrated that the state of both 
haematopoietic cell and microenvironment affects the physical asso- 
ciation between components of haematopoietic tissue. Continued 
technical developments and the generation of more sophisticated 
molecularly modified animal models will permit further analysis of 
physiologically relevant settings—other than transplantation or arti- 
ficially permissive environments—and of other stem cells of normal 
and malignant types as they engage and perhaps compete for niche 
elements. The combination of molecular, cellular and organismal 
biology may thereby be simultaneously examined to understand 
the regulatory networks affecting stem cell control. 


METHODS SUMMARY 

All mice were housed according to IACUC guidelines and used for experiment 
when 8—14-weeks old. Wild-type C57BL/6.SJL mice were HSPC donors when 
recipients were wild type, Col2.3-GFP or WWv double mutant (backcrossed to 
C57BL/6 background). FVB mice were donors for wild-type or PPR littermate 
mice*” (gift from E. Schipani). 

Mice were anaesthetized and prepared for in vivo imaging as described’. 
Immediately before imaging 20 pl of non-targeted Qdot 800 or 655 
(Invitrogen) diluted in 130 ll sterile PBS was injected retro-orbitally to allow 
vasculature visualization. The mouse was held in a heated tube mounted on a 
precision 3 axis motorized stage (Suter MP385). All mice were imaged with a 
custom-built confocal two-photon hybrid microscope specifically designed for 
live animal imaging (see Methods). At the start of each imaging session, we 
surveyed large areas of the skull bone surface using video rate second harmonic 
microscopy (see Methods) to identify the major anatomical landmarks such as 
sagittal and coronal sutures. We identified the locations of HSPCs within bone- 
marrow cavities and recorded their coordinates relative to the intersection of the 


Microenvironment 


blue, bone collagen. Scale bar: 50 tum. d, Independently of the LT-HSC- 
enriched starting population, the distance between DiD-labelled cells and 
osteoblasts measured 2 days after injection increased with the number of 
cells found in each observed cluster (n = 5 imaged mice). e, Two days after 
injecting the same number of LKS cells in irradiated wild-type or PPR mice 
(n = 2 + 2) similar numbers of cells were observed, but LKS progeny in PPR 
mice were preferentially located adjacent to the endosteal surface. Red lines 
indicate the mean of all measurements in each set of experiments. 


sagittal and coronal sutures. SHG and GFP signals above each identified HSPC 
were acquired every 5 to 201m until the above endosteal surface was reached. 
After in vivo imaging, the scalp was re-closed using 3 M Vetbond veterinary glue 
and post-operative care was provided as described’. 

Images were coloured and merged using Adobe Photoshop and HSPC- 
microenvironment distance measures were obtained using Adobe Illustrator 
and Microsoft Excel. A two-tailed type 2 t-test was applied to all data. P 
values =0.05 were considered statistically significant. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Microscopy. Second harmonic signal is generated by collagen in the bone when 
it is illuminated by femtosecond titanium:sapphire laser pulses*'. The exact 
wavelength for SHG was not critical as it is a scattering rather than an absorption 
process. GFP-expressing osteoblasts were imaged by either two-photon (920 nm 
excitation, 500-590 nm detection) or confocal microscopy (491 nm excitation, 
505-590 nm detection). Circulating Qdot 655 was imaged with two-photon 
microscopy (920 nm excitation, 625-675 nm detection). Qdot 800 was imaged 
with confocal microscopy (532nm excitation, >795nm detection). DiD- 
labelled cells were identified and distinguished from autofluorescent cells by 
taking two confocal images at 633 nm (650-760 nm detection) and at 532 nm 
(560-640 nm detection), respectively. The opposite settings were used to identify 
Dil-labelled cells. 

A X30 0.9NA water immersion objective (Lomo) was used for all imaging, so 

that each field observed always corresponded to 330 X 330m. To minimize 
chromatic aberration owing to the wide range of excitation wavelengths used, 
the collimation of each excitation laser beam was adjusted with a telescope to 
achieve parfocality. For three-dimensional analysis of bone-marrow cavity, 
Z-stacks were acquired at 1-3 41m steps. A PCI-based image capture board 
(Snapper24, Active Silicon) was used to acquire up to three channels simulta- 
neously using iPhoton” software running under Mac OS X that was developed 
in-house. 
Image processing. Each channel was acquired in 8-bit greyscale and merged to 
an RGB image using iPhoton or Photoshop (Qdot, red; GFP, green; collagen, 
blue). The fourth channel (DiD) was given a white colour code and merged with 
the RGB image. The intensity for each channel was scaled so that 0-255 in the 
8-bit image corresponded to 7—Imax in the original image, where 7 was the noise 
floor and I,,ax was the maximum pixel value. 

Two-dimensional measurements were performed by drawing and measuring 
lines using Adobe Illustrator and importing images at 500 X 500 point size. 
Points-to-micrometres conversion and all data analysis was performed using 
Microsoft Excel software. 

Three-dimensional rendering was obtained using ImageJ 1.38 (open source 
NIH software, http://rsb.info.nih.gov/ij/index.html). 8-bit stacks from each 
channel were given a threshold to eliminate hue differences within structures 
and assigned an arbitrary intensity (highest to bone, lowest to vasculature and 
intermediates to HSCs and osteoblasts) and combined in opaque models. Bright 
and dark outlier pixels were eliminated to reduce noise and each structure was 
assigned blue/red/green/white colour from the LUT table. The Volume) plugin 
was used for ‘cine-mode’ rendering with the following settings: classifier ramp + 
index, scaling =1, aspect ratio 1:1:1 or 1:1:3 depending on the Z-stack step, light 
0, 0, — 1,000. 

To measure distances between osteoblasts and vasculature, multiple image 
stacks from the same bone cavity were concatenated based on coordinate 
information. The resultant stack was imported into ImageJ, re-sliced into the 
vertical planes (Y—Z, X—Z) and digitally rotated to remove skull tilt relative to 
imaging plane. The stack channels were separated, and the greyscale intensity 
profile along the Z direction of the bone was measured and fit into an exponen- 
tial decay curve as a function of depth, the inverse of which was then multiplied 
to the bone (blue) channel. A similar function was applied to the osteoblast 
(green) and the vascular (red) channels using an empirically derived exponent. 
A multiplicative constant was incorporated to equalize channel brightness. 
Euclidean distance maps (EDM), codifying the minimal pixel distance between 
each non-vascularized point to the vessel walls as a greyscale value, were gener- 
ated in ImageJ based on the binarized vascular signal for the three Cartesian 
planes (X-Y, Y-Z and X—Z); the final three-dimensional-EDM stack was created 
after re-slicing the two vertical (YZ, XZ) EDMs in the XY orientation and assign- 
ing to each pixel the smallest greyscale value out of the three planar EDMs. The 
binary signal of the cells was inverted and the larger of the greyscale values 
between this stack and the vascular EDM was elected for each pixel; the resultant 
stack thus codified the distance from all points of the cells to the vessel wall with a 
consistent background value of 255. The histogram of these stacks constituted 
the distance distribution data. 

FACS analysis. To analyse LT-HSC subpopulation overlap, whole bone-marrow 
mononuclear cells were stained with c-Kit allophycocyanine, Flk2 phycoerythrin 
(PE), Lineage cocktail biotin (B220, Macl, Terll19, CD3, CD4, CD8 at 
1:1:1:1:1:1) followed by streptavidin PrcP (BD Pharmingen), Sca-1 PE—Cy5.5 
(Caltag), CD34 fluorescein isothiocyanate (FITC) (eBioscience), CD48 Pacific 
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blue and CD150 PE-Cy7 (BioLegend). At least 2 X 10° events were acquired 
using a BD LSRII flow-cytometer. Compensation and data analysis were per- 
formed using Flowjo 8.5.3. BrdU was detected using the BD BrdU flow detection 
kit on either whole bone-marrow or lineage-depleted cells. 

HSPC isolation, staining and injection. Whole bone-marrow cells were col- 
lected by crushing tibias, femurs, hips and spine of donor mice, stained with 
biotin-conjugated lineage cocktail antibodies and subjected to lineage depletion 
using Miltenyi magnetic beads and columns. The resulting lineage-depleted 
fraction was stained with the antibodies listed above and sorted using FACS 
DiVa or FACS ARIA (Becton Dickinson). A small fraction of the collected cells 
was re-run through the sorter and over 95% purity was consistently confirmed. 

1-5 X 10° cells per ml HSPCs were stained with 5 uM DiD or 7.5 uM Dil in 

PBS without serum for 10 min at 37 °C, washed once in PBS and immediately 
injected into the tail vein of recipient mice. Unless stated differently, each imaged 
CD45.2 mouse received 8,000 to 15,000 labelled CD45.1 HSPCs together with 
3-5 X 10° CD45.2 supportive whole bone-marrow mononuclear cells to ensure 
survival. 
Bone-marrow transplantation. Unless otherwise stated, recipient mice were 
lethally irradiated with 9.5 Gy the night before transplantation. To evaluate 
DiD toxicity, 10° CD45.1 whole bone-marrow mononuclear cells stained or 
unstained with DiD were injected together with 10° CD45.2 unstained whole 
bone-marrow mononuclear cells into CD45.2 recipients (five mice in each 
group). For serial transplantation, 30,000 DiD-stained CD45.2 LKS 
CD48 Flk2~ LT-HSCs were injected into two CD45.1 recipients. After 6 weeks, 
primary recipients’ whole bone-marrow mononuclear cells were collected and 
pooled. 2 X 10° cells were injected in each of five secondary CD45.1 recipients. 
Recipients’ peripheral blood was assessed for CD45.1/CD45.2 chimaerism at 3, 
12, 16 and 27 weeks after transplant using PE- and FITC-conjugated antibodies, 
respectively (BD Pharmingen). PB chimaerism was monitored also in all imaged 
mice. 

Calvarium and femur whole bone-marrow long-term reconstitution ability 
was compared by competitive transplant of 10° CD45.1 calvarium or femur 
whole bone-marrow mononuclear cells together with 10° CD45.2 femur whole 
bone-marrow mononuclear cells into CD45.2 recipients. Recipients’ peripheral 
blood chimaerism was assessed 4, 8, 12 and 16 weeks after transplantation. 

The ability of HSCs to engraft fully in the bone marrow of WWv mice was 
checked by serial transplantation. A total of 10,000 LT-HSC-enriched cell popu- 
lations were injected in non-irradiated WWv recipients and 6 months later 
2 X 10° whole bone-marrow mononuclear cells from reconstituted femurs were 
injected into separate CD45.2 recipients. Peripheral blood chimaerism was 
assessed 12 weeks later. 

BrdU administration. Wild-type CD45.1 mice transplanted with wild-type 
CD45.2. LKS CD34 Flk2~ or Rosa26 rtTA x TRE H2BGFP* LKS 
CD150*CD48~ cells received 100 pl BrdU solution (10 mg ml ') via intraper- 
itoneal injection every 12h from 12h after transplant. Negative control mice 
were injected with saline. Two days after transplantation bones were harvested 
and either crushed for flow cytometry detection of BrdU uptake or processed for 
immunofluorescence. 

Immunofluorescence. Frozen sections from paraformaldehyde fixed and decal- 
cified tibias from wild-type mice were stained with anti-CD31 biotin-conjugated 
antibody (BD Pharmingen) using the Perkin Elmer Tyramide Kit following the 
manufacturer’s instructions. Streptavidin HRP and tyramide TMR were used to 
visualize CD31 localization. Vectashield (VectorLabs) containing DAPI nuclear 
counter-stain was used to mount the sections. Images were acquired with a 
Nikon Eclipse 80i epifluorescence microscope equipped with a Qimaging 
Micropublisher digital CCD colour camera. BrdU staining was performed using 
the BD BrdU in situ detection kit according to the manufacturer’s instructions 
and terminating with Perkin Elmer TMR tyramide amplification. Rabbit poly- 
clonal anti-GFP antibody (Invitrogen) was incubated at the same time as the 
BrdU antibody and donkey anti-rabbit Alexa 647 conjugated antibody 
(Invitrogen) was incubated at the same time as streptavidin HRP. Images were 
acquired with a Bio Rad MRC600 confocal microscope. 
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Detection of functional haematopoietic stem cell 
niche using real-time imaging 
Yucai Xie’?*, Tong Yin'*, Winfried Wiegraebe', Xi C. He’, Diana Miller’, Danny Stark', Katherine Perko’, 


Richard Alexander’, Joel Schwartz’, Justin C. Grindley’, Jungeun Park’, Jeff S. Haug’, Joshua P. Wunderlich’, 
Hua Li', Simon Zhang’, Teri Johnson’, Ricardo A. Feldman® & Linheng Li’* 


Haematopoietic stem cell (HSC) niches, although proposed decades 
ago’, have only recently been identified as separate osteoblastic and 
vascular microenvironments’ °. Their interrelationships and inter- 
actions with HSCs in vivo remain largely unknown. Here we report 
the use of a newly developed ex vivo real-time imaging technol- 
ogy and immunoassaying to trace the homing of purified green- 
fluorescent-protein-expressing (GFP*) HSCs. We found that 
transplanted HSCs tended to home to the endosteum (an inner bone 
surface) in irradiated mice, but were randomly distributed and 
unstable in non-irradiated mice. Moreover, GFP* HSCs were more 
frequently detected in the trabecular bone area compared with 
compact bone area, and this was validated by live imaging biolu- 
minescence driven by the stem-cell-leukaemia (Scl) promoter— 
enhancer’. HSCs home to bone marrow through the vascular 
system. We found that the endosteum is well vascularized and 
that vasculature is frequently localized near N-cadherin* pre- 
osteoblastic cells, a known niche component. By monitoring indi- 
vidual HSC behaviour using real-time imaging, we found that a 
portion of the homed HSCs underwent active division in the irra- 
diated mice, coinciding with their expansion as measured by flow 
assay. Thus, in contrast to central marrow, the endosteum formed a 
special zone, which normally maintains HSCs but promotes their 
expansion in response to bone marrow damage. 

To study the functional interaction between the HSC and its niche, 
we developed a method of ex vivo imaging stem cells (EVISC) (Fig. la 
and Supplementary Fig. 1). This method combines real-time imaging 
technology*® (Supplementary Movie 1) and the ability of HSCs to 
home to their niche after transplantation into irradiated mice”’? 
(Supplementary Methods). To identify and track the transplanted 
HSCs, we isolated Flk2° Lin Sca-1*c-Kit* (Flk2~LSK) cells that 
are enriched with purified HSCs? from actin-driven GFP-transgenic 
mice (Supplementary Fig. la, b). We compared the behaviour of 
these cells after transplantation into irradiated and non-irradiated 
mice using two-photon microscopy’ (Fig. 1a, b). It has been docu- 
mented that HSCs migrate from tail vein to bone marrow in a few 
hours’*"*. We obtained real-time images of HSC arrival in BM 
around 5-8h after transplantation. In non-irradiated mice, we 
detected GFP* HSCs in bone marrow (Fig. 1b and Supplementary 
Movie 2). In irradiated mice, however, we more frequently observed 
GFP* HSCs homing to the endosteal region (Fig. Ic and 
Supplementary Movie 3). Our real-time imaging method enabled 
observation of dynamic interactions between HSCs and their niches. 

As the EVISC assay targeted limited areas, we obtained a more 
complete picture of the distribution of the transplanted GEP* HSCs 


by immunostaining multiple longitudinal sections of femurs and 
tibias from three experiments. Positions of GFP’ HSCs were mea- 
sured as cell distance (where 1 cell distance (CD) = 6-8 um) from the 
endosteal surface. (Fig. 1d, e). We generated a cumulative percentage 
plot to display distribution patterns under irradiated versus non-irra- 
diated conditions (Fig. 1f). In non-irradiated recipients, GEP* HSCs 
accumulated steadily with increasing distance from the endosteum. In 
contrast, in irradiated recipients, the GFP” HSCs accumulated rapidly 
close to the endosteum, but more gradually from 3 CD outwards. 
Two-sided Fisher’s exact test confirmed that the cells were signifi- 
cantly more likely to be located within =2 CD under irradiated than 
non-irradiated conditions (58.1% versus 12.5% respectively, 
P= 0.009, Fig. 1g and Supplementary Table 1). Cell distribution pat- 
terns 3CD from the endosteum and beyond were not significantly 
different between the two conditions (P = 0.9576). In addition, we 
observed substantial reduction in bone marrow cellularity and 
enlarged blood vessels in irradiated bone marrow (Fig. 1d, e). Thus, 
after bone marrow damage, HSCs showed biased homing favouring 
the ‘endosteal zone’ (S 2 CD) rather than central marrow (=3 CD). 

We also observed that GFP” HSCs homed predominantly to the 
trabecular bone area (83.8%) compared with the compact bone area 
(16.2%) (P=0.001) under irradiated conditions, but showed no 
preference to either trabecular or compact bone area (42.3% com- 
pared with 57.7%) under non-irradiated conditions (Fig. 1h and 
Supplementary Table 2). We wondered what would be the mech- 
anism underlying the biased HSC homing, and found that expression 
of Sdf1 (ref. 15), a key chemotactic factor, was increased (2.7-fold) in 
trabecular bone area in response to irradiation (Fig. 11). 

To confirm the location of HSCs in live mice, we used Scl-TVA 
transgenic mice in which an avian retrovirus receptor is driven from 
Scl-promoter-3’ enhancer regulatory elements that are active predomi- 
nantly, but not exclusively, in HSCs'*'”. When these mice were injected 
with an avian virus containing a luciferase reporter (RCAS-Luc), only 
the Scl-TVA™ cells were susceptible to infection, allowing this popu- 
lation to be visualized by live-imaging bioluminescence, which reflected 
the luciferase activity”'® (Supplementary Fig. 2). Bioluminescence 
imaging displayed only transient signals in liver and spleen (comparing 
13 days with 4 months in Fig. 1)), reflecting short-term mobilization 
and potential expansion of HSCs in response to bone marrow damage, 
but showed strong and persistent signals (>10 months) in the trabecu- 
lar bone area of both legs and other regions (Fig. 1j, k and 
Supplementary Fig. 2c), reflecting self-renewing HSCs. 

The tendency of HSCs to home to the endosteum suggested pref- 
erential homing to the osteoblastic niche, which was thought to be 
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physically distant from the vascular niche*'””°. Therefore, we exam- 
ined the relationship between osteoblastic and vascular structures and 
found that vasculature was distributed throughout bone marrow, 
extending into most endosteal zones (Fig. 2a). 

We next determined to what extent N-cadherin* osteoblastic cells, 
a key niche component””, and vasculature were juxtaposed (Fig. 2b-f, 
Supplementary Fig. 3 and Supplementary Table 3). The overall distri- 
bution of N-cadherin® osteoblastic cells in the femur was revealed by 
immunoassaying (Fig. 2b) and in N-cadherin-LacZ”' transgenic mice 
(Supplementary Fig. 4). Overall, the coverage rates by vasculature 
were higher (57% of trabecular bone area; 85% of compact bone area) 
than that of N-cadherin™ osteoblastic cells (47% of trabecular bone 
area; 45% of compact bone area) (Fig. 2g). This might result in dif- 
ferent microenvironments: N-cadherin* osteoblastic cells devoid of 
vasculature (13% of trabecular bone area; 6% of compact bone area); 
vasculature and N-cadherin cells (23% of trabecular bone area; 45% 
of compact bone area); and vasculature plus N-cadherin* osteoblastic 
cells (34% of trabecular bone area; 40% of compact bone area) 
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Figure 1| Detection of HSCs using EVISC, 
immunoassaying and the ScI-TVA animal model. 
Differential interference contrast (DIC) images 
show bone structure. Diamidino-2-phenylindole 
(DAPI) staining is blue. BM, bone marrow; BV, 
blood vessel. a, A diagram showing the process of 
EVISC. b, ¢, Detection of a GFP* HSC 
(GFP-HSC) using EVISC. d, e, Representative 
images of GFP* HSCs (red arrow) at different 
positions. The average bone marrow cell 
diameter (6-8 1m) was used as one cell distance 
(CD). f, Cumulative percentage plot of GEP* 
HSC numbers at each position away from 
endosteum (d, e). g, h, GEP* HSC distribution in 
endosteal and central marrow zones (g) and in 
trabecular bone area (TBA) and compact bone 
area (CBA) (h). The Fisher’s exact test was used 
for the statistical analysis (Supplementary Tables 
1 and 2). CM, central marrow. i, Messenger RNA 
level of Sdf1 (also known as Cxcl12) in bone- 
marrow-depleted trabecular bone area (n = 3, 
error bars represent standard deviation, s.d.). 

j, k, Live bioluminescence imaging of Scl-TVA 
animals injected with RCAS-Luc virus; ventral 
(j) and dorsal (k) views are shown. 
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(Fig. 2h). We calculated the proportion of the endosteum expected 
to be covered by both vasculature and N-cadherin™ cells (given the 
individual frequencies of these properties and assuming independent 
distribution). We noted that the proportion was higher than expected 
in trabecular bone area (34% versus 29%, P = 4.0 X 10°) but similar 
to expectation in compact bone area (40% versus 38%, P= 0.47) 
(Fig. 2i and Supplementary Table 4), indicating their non-random 
co-association in the trabecular bone area. 

Osteoblasts have been viewed as a key component of the HSC 
niche; however, there are different types: pre-osteoblastic cells 
(stroma-derived (Supplementary Movie 4) and normally inactive 
but which can be activated to give rise to osteoblasts) and osteoblasts 
(precursors for osteocytes, a component of the bone structure)”. We 
examined cells expressing a known osteoblast marker, osterix” (Osx, 
a nuclear factor, also called Sp7), and N-cadherin. Osx" cells revealed 
a typical osteoblast distribution pattern (Fig. 2j). Sections co-stained 
with these two markers showed expression of N-cadherin in some 
Osx* cells and in some Osx’ cells that localized either along the bone 
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surface or adjacent to Osx~ osteoblasts (Fig. 2k). This distribution 
pattern of N-cadherin™ cells adjacent to Osx” cells suggested that 
N-cadherin™ cells were more probably pre-osteoblastic cells. We also 
validated the relationship between N-cadherin™ cells and osteoblasts 
(Fig. 21, m) through staining sections derived from a Col2.3-GFP* 
transgenic mouse in which GFP was driven by the collagen-I«2.3 
promoter and specifically expressed in osteoblasts” (Fig. 21). 

The endosteum offers homing or resident HSCs a range of possible 
microenvironments as discussed above. To determine with which 
type of cells GFP’ HSCs tended to interact, we co-stained with 
N-cadherin and GFP. GEP* HSCs were observed directly attaching 
to N-cadherin“ cells (Fig. 2n, 0, and Supplementary Fig. 5a) and were 
close or adjacent to such cells in 10 out of 17 cases (59%; 
Supplementary Fig. 5a, b). This is consistent with our previous obser- 
vation of endogenous HSCs (identified using CD201 (ref. 24)) inter- 
acting with N-cadherin® osteoblastic cells*”°. Notably, some of 
the GFP* HSCs were also adjacent to N-cadherin™ cells in central 
marrow (Supplementary Fig. 5c). However, not all GFP” HSCs were 
close to N-cadherin® cells (Supplementary Fig. 5b), indicating the 
existence of additional N-cadherin” niche components. 

Although the level of N-cadherin measured as fluorescence intens- 
ity in the GFP* HSCs was much lower (average ninefold) compared 
to adjacent osteoblastic cells (Fig. 2p, q and Supplementary Table 5), 
its asymmetric distribution at the interface between HSCs and pre- 
osteoblastic cells (Fig. 2p and Supplementary Movie 5) validates our 
previous observation’. 

HSCs normally maintain a steady number but this can be increased 
in response to bone marrow damage”’. We compared proliferation of 
homed GFP* HSCs 4-6h after transplantation using an immuno- 
assay with a marker Ki67 (Fig. 3a—d). In the non-irradiated bone 
marrow, clusters of proliferating cells were typically seen in the 
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Figure 2 | Relationships between osteoblastic 
and vascular structures and between GFP* 
HSCs and osteoblasts. a, b, Distribution of 
vasculature (CD31, red; a) and N-cadherin © cells 
(N-cad, green; b) in bone (DIC, grey) and bone 
marrow (DAPI, blue). c-f, Representative images 
of endosteal surface covered by CD31 only 

(c), N-cadherin only (d), and both (e) in the 
trabecular bone (TB) area and compact bone 
(CB) area (f). g, h, Percentage coverage by CD31 
and N-cadherin (g), and quantification of the 
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central marrow, and isolated proliferating cells were often seen in 
the endosteum (Supplementary Fig. 6a). In the irradiated host, how- 
ever, proliferating cells were substantially increased in the endosteal 
region (Supplementary Fig. 6b). Furthermore, we found that GFP* 
HSCs were rarely Ki67~ in non-irradiated hosts but were five times 
more often Ki67~ in irradiated hosts (7% compared with 35%) 
(Fig. 3a-c and Supplementary Table 6). Notably, Ki67*GFP* 
HSCs were frequently found in locations adjacent to clusters of pro- 
liferating cells, and this was particularly obvious under irradiated 
conditions (Fig. 3b). Irradiation-induced changes in the location of 
proliferating cells were also reflected by more Ki67*GFP* HSCs but 
fewer Ki67- GEP* HSCs being detected in the endosteal compared to 
the central marrow regions (Fig. 3d and Supplementary Table 7). All 
these observations indicate that irradiation induced a dynamic 
change in the endosteal microenvironment from normal inhibition 
to stress-induced stimulation of proliferation, favouring the expan- 
sion of homed GFP* HSCs. 

Using EVISC, we were also able to observe directly HSC prolifera- 
tion. We performed this experiment at different time points. At 5-8 h 
after transplantation (Fig. 3e), a GEP* HSC located at the deepest 
part of a recess in bone gave rise to two daughter cells (Fig. 3fi and 
Supplementary Movie 6). In another case, we observed two homed 
HSCs in the same field of view with different behaviours over 15-h 
real-time imaging: one was stable and the other was actively mobile 
and dividing (Fig. 3j, k and Supplementary Movie 8). At 36—40 h after 
transplantation (Supplementary Movie 7), three out of four GFP* 
HSCs in the same area were stable at the endosteal region. We 
recorded active division of homed GFP* HSCs in three out of 
twenty-one cases (14.2%) in three independent experiments. Our 
immunoassays identified the endosteal zone of irradiated recipients 
as a site of increased HSC proliferation after transplantation. Our 


99 


©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


Non-irradiated Irradiated c d 
40 eee 
= Non-irradiated 
_ | alrradiated ze, 
E30 3 
a 
& P=0.02 3 O60 
2 £2 
+ oT 
20 Se 
is £6 40 
e 3° 
10 & 20 


] ™Ki67* GFP-HSCs 
OKi67- GFP-HSCs 
P=0.012 


NATURE]|Vol 4571 January 2009 


Figure 3 | Real-time imaging and 
immunoassaying of homed GFP* HSCs and 
measurement of HSC proliferation and 
expansion. a, b, Co-staining GFP~ HSCs with 
Ki67. c, d, Percentage (c) and distribution (d) of 
proliferating GFP* HSCs. (See Supplementary 
Tables 6 and 7 for statistics.) e, Real-time imaging 
of homed GFP* HSCs. Bone structure (DIC). 
f-i, Division of GEP* HSCs. j, k, Inactive (yellow 


Endosteum Central marrow 


0 
f g joc 
y a 
y <a 


Two weeks after eae 

Non-irradiated 0 8 
Q 

a 

a 

Le 

) 

ie) 

fe} 

Cc 

2 

3 

fo} 

Qo 

o} 2 

ed 

Sca-1 GFP 
Two weeks after transplantation 

r 

Irradiated a 
(e) 

2) 

7 

a 

ir 

) 

xo) 

fe) 

= 

Bo} 

3 

[o} 

ao 

Be} 

< 


ability to detect division of homed HSCs by EVISC correlates well 
with this finding. The key advantage of EVISC is that we can monitor 
the diverse dynamic behaviour of individual HSCs over many hours 
of real-time imaging versus a snap-shot of a static picture. 

To confirm the observed expansion of homed HSCs in irradiated 
mice, we measured the HSC number using flow cytometry assay 2 weeks 
after transplantation of 2,000 GFP* HSCs (Flk2~ LSK; Fig. 3l-r). First 
we found that HSCs were not stable in non-irradiated bone marrow, as 
GEP* HSCs were rarely detected (Fig. 3m, n). In contrast, HSCs trans- 
planted into irradiated mice underwent rapid expansion (Fig. 30, p). 
There were 1,959 GFP” Flk2 LSK cells in all the legs (two femurs plus 
tibias, 15% of the whole body*) determined by flow assay (Fig. 3q), and 
this equates to 13,065 (1,959, 15%) GFP Flk2~ LSK cells in the whole 
mouse body (Supplementary Table 8). Interestingly, we also found a 
higher number of expanded HSCs in the trabecular bone area than in 
the compact bone area (Fig. 3r), separated as indicated in Fig. 31. 

Here we developed an ex vivo system to image the homing and 
proliferation of HSCs in real time, facilitating detection of functional 
niches and monitoring the behaviour of stem cells. We showed that 
the endosteal region forms a special zone to maintain HSCs 
normally but can support their expansion in response to bone 
marrow damage. We clarified that HSCs tend to (but do not always) 
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arrow) and active (white arrow) GFP’ HSCs 
during 15-h real-time imaging. Blue indicates 
bone (dotted line). |, Haematoxylin and eosin 
staining shows separation of trabecular bone area 
(TBA) and compact bone area (CBA). 
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(FIk2 LSK). q, Absolute number of GEP* HSC 
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trabecular bone area and compact bone area 
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home to and directly interact with N-cadherin* pre-osteoblastic cells 
(Fig. 4a). Osteoblastic-cell-generated signals normally inhibit cell 
proliferation*®; however, vascular signals can also be a major com- 
ponent of the endosteal zone (Fig. 4a). Bone marrow damage 
results in a changed microenvironment that favours HSC transient 
expansion. We speculate that this might result from an enhanced 
vascular signal and potentially attenuated inhibitory signals from 
osteoblastic cells (Fig. 4b). Moreover, signals from osteoblastic cells 
can directly promote HSC proliferation in the context of mobiliza- 
tion” (Fig. 4b). In contrast, the central marrow may form a different 
zone where dominant vascular signals favour cell proliferation, as has 
been proposed” and is further supported by the transient prolifera- 
tion of HSCs in the vasculature-dominant spleen in response to bone 
marrow damage (Figs 1j and 4b). Although we focused on investi- 
gating osteoblastic and vascular components, other cell types (osteo- 
clasts, reticular cells, neuronal cells, haematopoietic 
progenitors) within the immediate environment can also influence 
the behaviour and states of HSCs. The EVISC method allows us to 
study the behaviour of individual HSCs and their dynamic interac- 
tions with niches in an ex vivo setting, and to study the stem cell fate 
determination in combination with different types of animal models 
in which HSC marking and lineage tracing are available. 
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Figure 4 | Illustration of endosteal zone and central marrow zone. a, The 
endosteal zone is the inner bone surface and is under homeostasis. HSCs 
residing in this zone normally receive inhibitory signals from osteoblastic 
cells but are also exposed to vascular signals. In the central marrow zone, 
vascular signals might be dominant. b, When bone marrow, including HSC, 
is damaged by irradiation, the endosteal environment transiently converts 
into a stimulatory environment. This might be due to both reduction in 
osteoblastic inhibitory and increase in vascular stimulatory signals. 


METHODS SUMMARY 

Mice were handled according to Institutional and NIH guidelines. Ex vivo imaging 
was achieved using a confocal laser scanning microscope LSM 510 META 
equipped with a pulsed NIR laser Chameleon Ultra for two-photon excitation. 
Fluorescence intensity measurement was processed using the Imaris 6.0.0 software 
(Bitplane) to facilitate three-dimensional rendering and fluorescence quantifica- 
tion. For immunofluorescent assay, anti-N-cadherin (1:50, YS, IBL), anti-CD31 
(1:100, BD Pharmingen), anti-GFP (1:200, Novus Biologicals), anti-Osx (1:200, 
Abcam) and anti-Ki67 (1:25, Dako) were used. X-gal and TRAP* staining followed 
our previous procedure””. To measure the coverage percentage by CD31 vascu- 
lature and N-cadherin® osteoblasts, for each image, a line was drawn along the 
bone surface; then we ran a custom java script which encompassed the area within 
16 tm of the bone surface, generating a straightened image and creating a line 
profile from that straightened image. Flow cytometry analysis of haematopoietic 
cells was described previously”. To compare HSCs in trabecular bone area and 
compact bone area, femurs and tibias were separated before collecting bone mar- 
row. The Scl-TVA mice were injected intravenously with 150 mg kg ' of 5-fluor- 
ouracil 4 days before injection of RCAS-Luc'* virus. Bioluminescence imaging of 
live mice was done using a Xenogen IVIS-200 imager. The live imaging could only 
detect strong bioluminescence signals, which might be attenuated by thick bone, 
muscle and fur. For real-time PCR we followed our reported procedure”. 

Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Ex vivo imaging of stem cells. We isolated the femurs and tibias from the 
transplanted mice and cut the bone in the vertical orientation (2-3 mm in height, 
Fig. la). The isolated bone (especially trabecular bone) was embedded vertically 
in 0.5% agarose in DMEM medium (phenol-red free) in a 3.5-cm dish facing the 
coverslip bottom, allowing the laser beam to penetrate into the bone marrow 
from the open end of bone/bone marrow (Fig. 1a). All live cell images were 
acquired using a confocal laser scanning microscope, LSM 510 META (Carl 
Zeiss), equipped with a pulsed NIR laser Chameleon Ultra (Coherent) for 
two-photon excitation. Before imaging live cells, we established 920 nm to be 
the optimal wavelength for two-photon excitation of GFP. We used a 32-channel 
META detector for spectral analysis and positive identification of GFP-labelled 
cells. After positive identification, we imaged the cells using an internal PMT 
detector with a band-pass filter of 500-550 nm, well separating the specific 
fluorescence signal from autofluorescence and second harmonic signals. The 
pinhole was set to its most open position. We used the Carl Zeiss objectives 
Plan Apochromat with a magnification of X20 or C-Achroplan NIR with a 
magnification of X40 with a numerical aperture of 0.8. These objectives are 
optimized for two-photon NIR excitation and a coverslip thickness of 
0.170 mm. The images shown in Figs 1 and 3 were not processed except for 
cropping, 3 X 3 median filter, and linear contrast adjustment. Caution needed 
to be exercised when a particular area was constantly exposed to the laser beam. 
The targeted cells might be damaged, resulting in the appearance of some large 
cells (~ 10-20 um) converted from GFP” into GFP* cells by actively fusing with 
GFP* donor cells, indicating that they were probably macrophages (not shown). 
The 3.5-cm glass bottom culture dishes were from MatTek Corporation. 
Probability to detect HSCs in single femur. An average adult mouse weights 
20 g. Assuming an average density of the mouse body of 1 g per 1 cm? (equal to 
the density of water) we estimate the volume of a mouse body to be 20 X 10° 
mm’. A femur is about 10 mm long and has a radius of 0.5 mm, resulting in a 
volume of 8 mm?. Thus, the volume ofa femur is 4 X 107‘ times the volume of a 
mouse body. If we inject 50,000 HCS and assume even distribution throughout 
the whole mouse body, we would expect to find 20 cells per femur. 
Immunohistochemical and immunofluorescent assay. The femurs and tibias 
were fixed in Zn?* -formalin and processed for paraffin sections. For immuno- 
fluorescent staining, after antigen retrieval using EZ Retriever Microwave 
(BioGenex), nonspecific antibody binding was blocked by incubating slides with 
1X universal blocking reagent (BioGenex, HK085-5k) at room temperature for 
lh. Anti-N-cadherin (1:50, YS, IBL), anti-CD31 (1:100 dilution, BD 
Pharmingen), anti-GFP (1:200, Novus Biologicals), anti-Osx (1: 200, Abcam), 
or anti-Ki67 (1:25, Dako) was added to the tissue section slides. YS polyclonal 
antibody was used for immunoassaying because the monoclonal antibody 


nature 


MNCD2 was not efficient. However, we cannot exclude the possibility that YS 
antibody may recognize other cadherins in stromal cells, as MNCD2 is not 
efficient to block the staining of the non-LSK cells out of lineage-negative bone 
marrow. The slides were incubated at 4 °C overnight. Donkey anti-rabbit 546/ 
568/488 or anti-rat 546/568 or donkey anti-goat 488 Alexa Fluor conjugate 
(1:200, Invitrogen) was added at room temperature for 1 h. For CD31 staining, 
we used the TSA Biotin System (PerkinElmer, NEL700). DAPI was used to stain 
the nucleus and the slides were then observed under a fluorescent microscope. 
X-gal and TRAP® staining has been described previously”””. 

Fluorescence intensity measurement. The data sets were processed using Imaris 
6.0.0 software (Bitplane) to facilitate three-dimensional rendering and fluor- 
escence quantification. The Surpass Scene in Imaris was used to create a contour 
surface object that encapsulated cells of interest based on DAPI fluorescence 
signal. The mean fluorescence intensity of the channel of interest in each contour 
surface was recorded. A manual background subtraction was then performed by 
measuring the mean red intensity of a uniform background area and subtracting 
the background mean intensity value from each of the cell intensity mean values. 
Analysis of CD31 and N-cadherin coverage of the endosteum. Expected 
‘double positive’ coverage of the endosteum by both CD31 and N-cadherin, 
assuming they are independently distributed, was calculated for each line profile 
as the product pX q where pand gare the total fractions of the line profile covered 
by CD31 and N-cadherin, respectively. The observed and expected double pos- 
itive rates were compared using two-tailed paired t-tests. 

Flow cytometry analysis of HSCs. Bone marrow cells were flushed from the 
femurs and tibias and the red blood cells were lysed using a 0.15 M ammonium 
chloride solution. HSCs were enriched using FACS as lineage PeCy5-negative 
(the lineage cocktail contained CD3, CD4, CD5, CD8, Grl, CD11b, B220, IgM 
and Terl19), Sca-1 PE-positive, c-Kit APC-positive, Biotin-Flk2 streptavidin 
PEcy7-negative. For transplantation experiments, donor HSCs were isolated 
from GFP transgenic mice (Jackson lab). Rescue bone marrow cells (2 X 10°) 
were from Ptpre. 

To compare HSCs in trabecular bone area and compact bone area, two ends of 
femurs and one end of tibias were cut as trabecular bone area (2-3 mm away 
from the end); the other end of tibia (3—4 mm away from the end) was cut and 
discarded; and the middle part of femurs and tibias was combined as compact 
bone area. Bone marrow cells were flushed from trabecular bone area and com- 
pact bone area separately then followed the regular HSCs analysis protocol. 
Real-time RT-PCR. We followed the procedure reported previously™. Primers 
used for amplification of Sdf1 were as follows: forward, 5'-GGACGCCAAGGT- 
CGTCGCCGT-3’; reverse, 5'- TAATTTCGGGTCAATGCACA-3’. HPRT was 
used as an internal control for normalization of input mRNA. 
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Carcinoma-produced factors activate myeloid cells 
through TLR2 to stimulate metastasis 


Sunhwa Kim’, Hiroyuki Takahashi’, Wan-Wan Lin'”, Pascal Descargues’, Sergei Grivennikov', Youngjun Kim't, 


Jun-Li Luo'+ & Michael Karin! 


Metastatic progression depends on genetic alterations intrinsic to 
cancer cells as well as the inflammatory microenvironment of 
advanced tumours’”. To understand how cancer cells affect the 
inflammatory microenvironment, we conducted a biochemical 
screen for macrophage-activating factors secreted by metastatic 
carcinomas. Here we show that, among the cell lines screened, 
Lewis lung carcinoma (LLC)’ were the most potent macrophage 
activators leading to production of interleukin-6 (IL-6) and 
tumour-necrosis factor-a (TNF-a) through activation of the Toll- 
like receptor (TLR) family members* TLR2 and TLR6. Both TNF-a 
and TLR2 were found to be required for LLC metastasis. 
Biochemical purification of LLC-conditioned medium (LCM) led 
to identification of the extracellular matrix proteoglycan versican, 
which is upregulated in many human tumours including lung can- 
cer*®, as a macrophage activator that acts through TLR2 and its co- 
receptors TLR6 and CD14. By activating TLR2:TLR6 complexes and 
inducing TNF-a secretion by myeloid cells, versican strongly 
enhances LLC metastatic growth. These results explain how 
advanced cancer cells usurp components of the host innate immune 
system, including bone-marrow-derived myeloid progenitors’, to 
generate an inflammatory microenvironment hospitable for meta- 
static growth. 

Distant-site metastases are the leading cause of cancer-associated 
mortality. They depend on genetic and/or epigenetic alterations that 
are intrinsic to cancer cells, or extrinsic factors provided by the 
tumour microenvironment’. For instance, cytokines produced by 
inflammatory cells can enhance metastatogenesis by repressing the 
metastasis suppressor maspin within primary prostate carcinoma 
cells*. Furthermore, tumour progression and metastasis positively 
correlate with presence of infiltrates containing myeloid and 
lymphoid cells*’. It has been shown that certain carcinomas secrete 
factors that upregulate fibronectin and recruit vascular endothelial 
growth factor receptor 1 (VEGFR1)-positive haematopoietic progeni- 
tors to sites of future metastatic growth, termed the pre-metastatic 
niche’. To examine whether cancer cells secrete factors that directly 
activate myeloid cells to produce tumour-promoting cytokines’, we 
collected serum-free conditioned medium from different cancer cell 
lines, derived mainly from C57BL6 mice, and applied it to bone- 
marrow-derived macrophages (BMDM), which were assayed for pro- 
duction of interleukin-16 (IL-1B), IL-6 and TNF-«. The screen 
included 1C1C7 and TrampCl, which are liver and prostate cancer 
cell lines, respectively, with little or no metastatic activity, and two 
metastatic breast and lung carcinomas, 4T1 and LLC, respectively. 
Conditioned medium from metastatic cells, especially LLC, induced 
higher amounts of IL-6 and TNF-« secretion than conditioned 
medium from non-metastatic cells (Fig. 1a). IL-1B secretion was 


undetectable and the conditioned medium did not contain IL-6 or 
TNF-a (data not shown). LCM also induced expression of IIIf, Il6é 
and Tnfx messenger RNAs (mRNAs), whereas serum-free medium 
(SFM) and NIH3T3-conditioned medium were inactive (Fig. 1b and 
data not shown). We investigated the metastatogenic function of some 
of the LCM-induced cytokines by tail vein injection of LLC into age- 
and sex-matched Tnfx and Ilé knockout mice and wild-type (WT) 
controls. Tnfx'~ mice exhibited markedly (P<0.001) reduced 


IL-6 TNF-a b 


4 


Fold induction 
ono a 
= 
\ * 


q 


Conditioned Conditioned 
medium medium 
c d 


100 
80 
60 


40 


204 — WT (n = 22) 
~~ Tntat (n = 15) 


0 
0 10 20 30 40 50 60 


Survival (%) 


Days after LLC inoculation 


21.9438 


Tumour multiplicities: 


Figure 1| Metastatic carcinomas secrete factors that induce macrophage 
production of TNF-a, needed for lung metastasis. a, BMDM were cultured 
with SFM or SEM conditioned by mouse carcinoma cells (conditioned 
medium), and cytokine production was measured (averages + s.d., n = 3). 
b, BMDM were cultured with SFM or LCM, and cytokine mRNAs were 
measured by quantitative PCR with reverse transcription. Fold-induction 
above SFM-treated cells was determined (averages + s.d., n = 3; *, P< 0.001 
by Student’s t test). c, Survival of WT (n = 22) and Tnfx !~ (n = 15) mice 
inoculated with LLC (1 X 10° cells through the tail vein (P < 0.001; log-rank 
test for significance). d, Lungs of WT and Tnfx '~ mice 47 days after LLC 
inoculation (2 X 10° cells). Tumour multiplicities are shown underneath 
(averages + s.d., n = 11, P<0.001 by Student’s ¢-test). 
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mortality compared with WT mice after inoculation with 1 x 10° LLC 
cells (Fig. 1c) and showed an even greater survival advantage when 
given a smaller LLC inoculum (Supplementary Fig. la). Similar 
differences were seen in lung tumour multiplicity (Fig. 1d). By 
contrast, there was little difference in survival of I/6— ‘~ and WT mice 
inoculated with 1 X 10° LLC cells (Supplementary Fig. 1b). Thus, 
TNF-« but not IL-6 is important for LLC metastasis. 

We explored the involvement of TLR family members in sensing 
LCM components. BMDM from mice deficient in TLR2, TLR3, 
TLR4 or TLR9 or their adaptor proteins, Myd88 and TRIF (which 
is inactivated by the Lps2 mutation: Trif™)*, were examined for pro- 
duction of IL-6, a convenient BMDM activation marker. LCM- 
induced IL-6 was fully dependent on TLR2 and Myd88 but not on 
TLR3, TLR4, TLR9 or TRIF (Fig. 2a). Tlr2-’— BMDM were also 
defective in LCM-induced [l1f and I/6 mRNA expression and 
LCM did not induce anti-tumorigenic type I interferon genes 
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Figure 2 | LLC-secreted factor activates TLR2 to induce lung inflammation. 
a, BMDM from indicated mouse strains were cultured with SEM or LCM, 
and IL-6 production was measured (m: mutant allele; averages + s.d., n = 3, 
presented as percentage of WT LCM-stimulated value). b, BMDM were 
treated with LCM or LPS (100 ng ml _'). Cell lysates were examined for 
kinase phosphorylation (P) and IxBa degradation by immunoblotting. 
Total ERK and HSP90 are loading controls. ¢, BMDM from indicated mouse 
strains were cultured with LCM or Pam3CSK4 (1 ng ml '), and IL-6 
production was measured (averages + s.d., n = 4). d, RNA was extracted 
from lungs of WT or Tir2 ‘~ mice at indicated times after LLC inoculation 
(2 X 10° cells). mRNAs were quantified as above and the amounts in non- 
inoculated WT or TIr2”'~ lungs were given a value of 1.0 (averages ~ s.e.m., 
n= 3; *P <0.05, **P < 0.005 (compared with WT) by Student’s t-test). 
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(Supplementary Fig. 2a), which are readily induced after TLR3 or 
TLR4 engagement''. TLR2 was also required for LCM-induced IL-6 
and TNF-a secretion by alveolar macrophages, which produced 
tenfold more TNF-« than BMDMs (Supplementary Fig. 2b). TLR2 
was required for optimal LCM-induced activation of mitogen- 
activated protein kinases and IkB kinase or IkBa degradation 
(Fig. 2b). TLR2 uses TLR1, TLR6 or CD14 as co-receptors'*. LCM- 
induced IL-6 production was dependent on TLR2, TLR6 and CD14 
but not on TLRI (Fig. 2c). By contrast, the response to Pam3CSK4, a 
bacterial lipoprotein analogue’, depended on TLR2 and TLRI1 but 
not on TLR6 and CD14. These results rule out possible contamina- 
tion with bacterial lipoproteins. Furthermore, anti-mycoplasma 
treatment of LLC had no effect on LCM activity (data not shown). 

To examine the in vivo role of TLR2, we inoculated sex- and age- 
matched Tir2 ’~ and WT mice with 2 X 10° LLC cells through the 
tail vein and measured mRNAs encoding cytokines and chemokines 
in their lungs. LLC-induced lung inflammation has been previously 
described'’, but its mechanism was unknown. Tnfa, Il1f, Il6 and 
inflammatory chemokine mRNAs were induced 5 days after LLC 
inoculation in WT lungs, peaking at 9days after inoculation 
(Fig. 2d). None of these mRNAs was induced in lungs of Tir2-/~ 
mice, whose basal content of Mip1/Ccl4, Mip2/Cxcl2 and Ke/Cxcl1 
mRNAs was higher than WT lungs (Supplementary Fig. 3). In addi- 
tion, Tlr2-’~ and WT mice were subcutaneously inoculated with 
LLC cells and examined for lung macrophage infiltration and inflam- 
matory cytokine gene expression days later. Although no difference 
was observed in primary subcutaneous tumour growth, macrophage 
infiltration and inflammatory cytokine gene expression were greatly 
reduced in Tlr2-‘~ mice compared with WT mice (Supplementary 
Fig. 4a—c). 

To investigate whether TLR2 signalling contributes to LLC metas- 
tatogenesis, we inoculated age- and sex-matched Tir2 ’ and WT 
mice with dsRed-labelled LLC cells through the tail vein and examined 
their lungs for micrometastases. WT but not Tlr2-/~ lungs showed 
small clusters of DsRed-LLC cells with adjacent CD11b* and CD11c* 
myeloid cells (Fig. 3a). We also detected a few cells that carried the 
CD3 antigen (T cells) in micro-metastases of WT lungs (data not 
shown). Importantly, Tlr2-’~ mice exhibited significantly greater 
(P< 0.02) survival than WT mice after LLC inoculation and their 
lungs contained fewer and smaller tumour nodules (Fig. 3b, c). 
Tumour nodules in WT mice contained more CD11b*/Gr1* inflam- 
matory monocytes/myeloid suppressors and IL-10"'"/F4/80* M2 
macrophages (Supplementary Fig. 5). Tlr2-’~ mice exhibited signifi- 
cantly fewer lung and liver tumour nodules than WT mice and a lower 
incidence of adrenal gland metastasis after subcutaneous implanta- 
tion of LLC cells (Fig. 3d). To investigate whether TLR2 acts in bone- 
marrow-derived cells, we examined survival of LLC-inoculated 
chimaeric mice. Mice reconstituted with Tlr2-’~ bone marrow 
(WT/TIr2-‘~) exhibited markedly improved (P<0.04) survival 
compared with mice reconstituted with WT bone marrow (WT/ 
WT) (Fig. 3e). WT/WT and WT/ Tlr2 ‘~ mice were also inoculated 
with 2 X 10° LLC cells followed by intraperitoneal injections of LCM 
or SFM. Lung and liver tumour loads were significantly higher 
(P<0.05) in WT/WT mice receiving LCM than in those receiving 
SFM along with the LLC inoculum (Fig. 3f). The pro-metastatic effect 
of LCM was dependent on TLR2 activation, as little or no metastatic 
enhancement was seen in WT/TIr2 ‘~ mice. These results strongly 
suggest that LCM contains TLR2-activating factors that enhance 
metastatogenesis. 

To identify the nature of these factors, we collected large amounts 
of LCM, separated it on a mono-Q anion exchange column and 
monitored column fractions for their ability to induce IL-6 in 
BMDM.. Fractions with IL-6-inducing activity were pooled and sepa- 
rated on a Superdex 200 sizing column (Supplementary Fig. 6). Most 
of the IL-6-inducing activity eluted in a few high molecular mass 
(greater than 400 kDa) fractions that contained several polypeptides 
larger than 200 kDa. These fractions were pooled, deglycosylated and 
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Figure 3 | TLR2is required for metastatic growth. a, WT and Tir2 /~ lungs 
were analysed 9 days after inoculation of DsRed-LLC (2 X 10° cells) for 
DsRed and myeloid cells markers (CD11b, CD11c) using fluorescence 
microscopy (magnification < 200). b, Survival of LLC inoculated (2 < 10° 
cells) WT (n = 8) and Tlr2-/~ (n = 10) mice (P< 0.02; log-rank test for 
significance). c, Lungs and haematoxylin-and-eosin-stained lung sections 
(magnification X25) 20 days after LLC inoculation. Tumour multiplicity is 
shown on the left (averages + s.e.m., n = 8, P< 0.001 by Student’s t-test). 
d, Tumour multiplicities of lung and liver metastatic nodules and incidence 


subjected to mass spectrometry, resulting in identification of several 
peptides derived from the extracellular matrix proteins: versican V1, 
laminin-B1, thrombospondin 2 and procollagen type III-«1 (Fig. 4a). 
To examine which protein accounts for induction of inflammatory 
cytokines, we incubated LCM with individual neutralizing antibodies 
before BMDM stimulation and measurement of cytokine produc- 
tion. Incubation of LCM with antibodies to versican, laminin-$1 or 
procollagen type II]-a1, but not thrombospondin 2, reduced IL-6 
and TNF-« production (Supplementary Fig. 7). A control antibody 
to HMGB1, an inflammatory mediator released by necrotic cells", 
did not inhibit cytokine induction. 

To investigate the role of these proteins in LCM-enhanced metasta- 
togenesis, we generated stable LLC cell lines containing short hairpin 
RNA (shRNAs) specific to versican V1, laminin-B1 or procollagen 
IlI-o1. Silencing efficiency was approximately 90% or higher 
(Fig. 4b). The silenced cells and LLC cells transduced with a control 
shRNA were injected into mice and lung tumour nodules and their 
survival monitored. Silencing of versican V1 significantly reduced 
(P<0.001) tumour multiplicity, but silencing of laminin-f1 resulted 
only in a modest inhibition, whereas procollagen III-o1 silencing 
slightly enhanced tumour multiplicity (Fig. 4c). Furthermore, silencing 
of versican V1 reduced lung nodule multiplicity fourfold 
(Supplementary Fig. 8a). Tumours isolated from mice inoculated with 
versican-silenced LLC cells displayed very low versican expression 
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of adrenal metastasis 17 days after primary tumour removal 

(averages + s.e.m., WT n= 9, Thr2 ’" n= 6; *P<0.05 by Student’s t-test). 
e, Survival of WT/WT or WT/TIr2"'~ chimaeric mice inoculated with LLC 
(2 X 10° cells) 6-7 weeks after bone-marrow reconstitution (P < 0.04; log- 
rank test for significance; n = 6). f, Lung tumour multiplicity (left), size 
(middle) and liver tumour multiplicity (right) in chimaeric mice, 27 (lung) 
or 48 (liver) days after LLC injection (2 X 10° cells) together with SEM or 
LCM (averages + s.e.m., n = 8; *P < 0.05 by Student’s ¢-test). 


(Supplementary Fig. 8b), whereas lung tumours from LLC-inoculated 
mice expressed more versican than normal lung (Supplementary Fig. 
8c). Importantly, mice inoculated with versican-silenced LLC cells 
exhibited significantly improved (P<0.05) survival (Fig. 4c). 
Silencing of versican also reduced metastatic spread to lung, liver and 
adrenal glands in the subcutaneous implantation model (Fig. 4d). To 
ascertain the proinflammatory and premetastatic functions of versican, 
we used the low metastasic LLC variant P29-LLC'’. Conditioned med- 
ium from P29-LLC did not induce IL-6 in BMDM and contained very 
little versican (Supplementary Fig. 9). Ectopic expression of human 
versican in P29-LLC cells increased lung tumour multiplicity after tail 
vein injection (Fig. 4e). 

To examine how versican activates macrophages, we produced 
His-tagged human versican V1 in LLC cells and purified it on an 
Ni-chelate column. The purified protein induced IL-6 and TNF-o 
production in WT but not Tir2-'~" BMDM (Fig. 4F). We investigated 
whether versican interacts with TLR2 or its CD14 co-receptor. 
Immunoprecipitation of lysates of LCM-incubated Raw264.7 macro- 
phages with versican-specific antibody, but not a control antibody, 
co-precipitated TLR2 and CD14 but not TLR4 (Fig. 4g). The versican 
antibody did not precipitate TLR2 unless the macrophages were first 
incubated with LCM. 

Metastasis is the result of a complex process involving invasion of 
adjacent tissues, intravasation, circulatory transport, arrest at a 
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distant site, extravasation, growth, survival and neoangiogenesis'®. 
Bone-marrow-derived cells, such as macrophages’ and haematopoietic 
progenitors’, are important participants in this process; however, how 
they are mobilized and activated to support metastasis is unclear. Our 
results indicate that versican secretion by LLC cells is necessary for 
metastatic spread to lung, liver and adrenal gland, a process that 
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Figure 4 | Versican is a TLR2 agonist and metastasis-enhancing factor. 

a, Identification of candidate LCM macrophage activators by mass 
spectrometry. Probability-based Mowse scores (upper) and tryptic peptides 
corresponding to the indicated proteins (lower). b, LLC were transduced 
with indicated shRNA lentiviruses. After selection, expression of the 
indicated proteins was analysed. c, Left: LLC transduced with indicated 
lentiviruses were injected (2 X 10° cells) into mice (n = 8). Lung tumour 
multiplicity was enumerated at 20 days (averages + s.e.m., P< 0.001 by 
Student’s t-test). Right: survival of mice inoculated with LLC transduced 
with control or Vers shRNAs (n = 9; *P < 0.05; log-rank test for 
significance). d, Lung and liver tumour multiplicities and incidence of 
adrenal metastasis 17 days after primary tumour removal 

(averages + s.e.m., n = 8 and n= 5 for mice injected with control or Vers 
shRNA-transduced cells, respectively; *P < 0.05 by Student’s t-test). 

e, Upper: non-transfected and human versican (hVers) transfected P29-LLC 
cells were analysed for versican expression. Lower: hVers non-expressing 
and expressing P29-LLC cells were injected through the tail vein (n = 7). 
After 27 days, lung metastatic nodules were enumerated (*P < 0.05 by 
Student’s t-test). f, Left: silver staining of purified 6 X His-hVers. Right: 
BMDM were incubated without or with His-hVers for 20-24 h, and cytokine 
secretion was measured (averages + s.d., n = 3). g, Lysates of Raw264.7 cells 
incubated with LCM or SFM for 1h were immunoprecipitated with 
versican-specific or control antibody and analysed by immunoblotting with 
the indicated antibodies. 


depends on TLR2-mediated myeloid cell activation and TNF-« 
production. Versican is an aggregating chondroitin sulphate proteo- 
glycan that accumulates both in tumour stroma and cancer cells®®. It 
can bind hyaluronan, and both versican and hyaluronan are highly 
expressed in non-small-cell lung cancer (NSCLC), especially in 
advanced disease with high recurrence rate, whereas versican in normal 
lung is rather low’. Versican or fragments thereof enhance tumour cell 
migration, growth and angiogenesis, processes that are of direct 
relevance to metastasis'’. Versican also binds to several adhesion mole- 
cules expressed by inflammatory cells and has pro-inflammatory 
activity'’. A related extracellular matrix proteoglycan, biglycan, has 
been reported to activate both TLR2 and TLR4”, but our results indi- 
cate that the pro-inflammatory activities of versican rely on TLR2 but 
not TLR4. TLR2 recognizes Gram-positive bacteria-derived lipotei- 
choic acid and lipoproteins*. This activity mainly depends on 
TLR2:TLR1 dimers”, but the response to versican requires TLR6 
and not TLRI as a co-receptor. Although TLR2 and versican interact, 
it is not clear whether this interaction is direct or depends ona versican 
ligand, such as hyaluronan. Indeed, hyaluronan fragments can activate 
macrophages through TLR2 (ref. 21), and hyaluronan accumulation 
and the enzyme that converts large hyaluronan polymers to smaller 
fragments, hyaluronidase, have been linked to metastasis”. 

TLR2 on host myeloid cells and their product TNF-« are important 
positive modulators of LLC metastatic behaviour, although neither 
protein influences primary tumour growth of subcutaneously 
implanted LLC. It appears that TNF-« is one of the major pro- 
metastatic factors produced by host myeloid cells. TNF-« can suppress 
the apoptosis of cancer cells and stimulate their proliferation through 
NF-«B activation”’. In addition, by increasing vascular permeability”, 
TNF-« can enhance recruitment of leukocytes as well as intravasation 
and extravasation of cancer cells. We suggest that versican, its inter- 
action with TLR2 and production of TNF-« by activated myeloid cells 
provide potential points for anti-metastatic intervention. 


METHODS SUMMARY 


Detailed methods are given in Supplementary Information. Briefly, LLC cells were 
injected through the tail vein or subcutaneously implanted into 6- to 7-week-old 
mice at 2 X 10° to 2 X 10° cells per mouse to measure metastases to lung, liver or 
adrenal gland. Metastasis-enhancing factors were purified from LCM by column 
chromatography, and identified on a QSTAR XL qQTOF mass spectrometer. 
Factor activity was determined by the ability to induce IL-6 production by 
BMDM. Gene and protein expression were monitored by quantitative PCR and 
immunoblot analysis, respectively. Tumours and their composition were 
analysed by immunohistochemistry and indirect immunofluorescence. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animal and cell culture. Tirl~’~, Tlr2'~, Tlr3’-, Tra !”, Tio, 
Myd88' ~ and Trif", Tlr6" and Cd14” (other names: Lps, Insouciant and 
Heedless, respectively) mutant mice were provided by S. Akira (Osaka 
University) and B. Beutler (The Scripps Research Institute). Tnfx ‘~ and 
Il6-‘~ mice were from the Jackson Laboratory. All strains used in metastasis 
studies were backcrossed to C57BL6 mice more than ten times. Mice were 
maintained under specific pathogen-free conditions, and experimental proto- 
cols were approved by the University of California, San Diego Animal Care 
Program, following National Institutes of Health guidelines. LLC, 1C1C7, 
TrampCl and 4T1 cells were from the American Type Culture Collection. 
DsRed-LLC cells were provided by R. Hoffman (Anti-Cancer, Inc.). The cells 
were cultured under standard conditions, except for collection of conditioned 
media, and were confirmed to be mycoplasma free. 

Metastasis assays. Formation of lung and liver metastases by cancer cells inocu- 
lated through the tail vein was examined as described”’. Briefly, subconfluent 
LLC cells were harvested and passed through a 40-j1m cell strainer, washed three 
times in PBS, re-suspended in PBS and inoculated at 2 X 10° or 1 X 10° cells per 
7- to 8-week-old mouse through the tail vein. At designated time points, mice 
were killed, and their lungs were removed, weighed and histologically examined. 
Some mice were kept until death and survival data were obtained. Lung tumour 
nodules were microdissected using an 18G needle under a microscope for 
protein and RNA analysis. Alternatively, the entire tumour-bearing lung was 
used. Subcutaneous cancer cell implantation was as described”. Briefly, 2 x 10° 
control or versican-silenced LLC cells were subcutaneously injected into the right 
flank. When necessary, the primary tumour was surgically removed using aseptic 
techniques at 3—4 weeks after inoculation, and lung, liver and adrenal gland 
metastases were analysed 2—3 weeks after removal of the primary tumour. 
Bone marrow transplantation. C57BL6 mice were lethally irradiated (1050 rad) 
and transplanted with 1 X 10° bone marrow cells from WT or Tir2‘~ mice. 
DsRed visualization, immunohistochemistry, histological analysis and flow 
cytometry. Tissues were immediately frozen in OCT compound (Tissue-Tek) 
and post-fixed with acetone. Sections were mounted with Vectashield containing 
DAPI (4,6-diamidino-2-phenylindole), and observed under a fluorescent micro- 
scope (Zeiss) to detect DsRed-labelled cells. The sections were also incubated 
with FITC-labelled antibodies to CD1llb, CDllc and CD3 (all from 
Pharmingen) to detect inflammatory cells. Histological analysis of M2 macro- 
phages was conducted as described”. Briefly, tumour-bearing lungs were fixed in 
10% neutral buffered formalin. Paraffin-embedded lung sections (5 tm) were 
kept at 60 °C for 1h, and then incubated at 94°C for 40 min in Dako’s antigen 
retrieval solution $2368. Slices were stained with primary antibodies against IL- 
10 (Santa Cruz Biotechnology) and F4/80 (Caltag) for 1 h at room temperature. 
A specific signal was detected using the appropriate Dako EnVision System, HPR 
(DAB) kit. CD11b*/Gr1* myeloid suppressors or inflammatory monocytes 
present in lungs were analysed by flow cytometry using CD11b-Alexa 647 and 
Gr1-PE-Cy7 (eBioscience) antibodies and an ACCURI flow cytometer. 
Conditioned media. Conditioned medium was collected from cells incubated in 
SEM for 20h and filtered through a 0.22-1m filter. Conditioned medium 
samples were added to BMDM for 16-20h, after which inflammatory cytokine 
gene expression or secretion were assayed. Conditioned medium (300 pl) were 
injected intraperitoneally three times, once every three days into WT mice that 
had received WT or Tir2-/~ bone marrow transplants 6-7 weeks before being 
inoculated with LLC cells. After 27 days, mice were killed and lungs were 
removed and histologically analysed by staining sections with Harris haematox- 
ylin and eosin. 

Analysis of gene expression. Total tissue RNA was prepared using RNAeasy kit 
(Qiagen). Quantitative PCR was performed as described”’. Primer sequences for 
Tnfz, I11P, 116 and Cph and RT-PCR conditions were as previously described”’. 
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Primer pairs for Mipla/Ccl3, Mip1$/Ccl4, Mip2/Cxcl2 and Ke/Cxcl1 were 
5'-TCTGCCGGTTTCTCTTAGTCA-3’/5’-ACCATGACACTCTGCAACCA-3’, 
5'-CTGCCTCTTTTGGTCAGGAA-3'/5'-TTCTGTGCTCCAGGGTTCTC-3’, 
5'-CTAGCTGCCTGCCTCATTCTAC-3’/5’-CAACAGTGTACYYACGCAGA- 
CG-3', 5’-CTTGGGGACACCTTTTAGCA-3'/5'-GCTGGGATTCACCTCAA- 
GAA-3'. 

Cells and tumour tissues were lysed and analysed by SDS—polyacrylamide gel 
electrophoresis and immunoblotting” with antibodies to p38, IxBa., procollagen 
II-o1, laminin-f1, tubulin (all from Santa Cruz Biotechnology), ERK, P-ERK, 
P-p38, P-IKKa/B (all from Cell Signalling, Inc.), versican V1 (Abcam), actin 
(Sigma) and hsp90 (Pharmingen). IL-6 and TNF-« were measured by 
enzyme-linked immunosorbent assay (R&D and Pharmingen). 

Lentiviral and retroviral transduction. siRNAs to mouse versican V1, laminin- 
B1 and procollagen III-«%1 mRNAs were generated as described*, cloned into 
pLSLPw, provided by I. Verma (The Salk Institute), and transfected into 293T 
cells. siRNA sequences for indicated genes were as follows: versican V1, 
GTACACAGTTGATGAAATA and TATTTCATCAACTGTGTAG; laminin-f1, 
ACAGATACTTCGCCTACGA and TCGTAGGCGAAGTATCTGT; procollagen 
II-%1, CCAGAACCATGTCAAATAT and ATATTTGACATGGTTCTGG. 
Human versican V1 vector was provided by D. Zimmermann (University of 
Zurich) and transfected into GP2-293T cells (Clontech). Lentivirus and retro- 
virus stocks were prepared as described*. Virus-containing supernatants were 
incubated with LLC cells for 2 days with polybrene, and infected cells were 
selected in 5 ug ml ' puromycin (Invitrogen) for shRNA or 10 ug ml! Zeocin 
(Invitrogen) for versican expression. 

Protein purification. Protein purification was conducted using a AKTA FPLC 
(GE Healthcare) system. Initially, samples were loaded onto a Mono-Q HR 5/5 
column (Amersham Bioscience) that was developed with a gradient of 0-1 M 
NaClin 20 mM Tris-HCl pH 8.0 buffer (flow rate 1 ml min” '). Fractions posses- 
sing IL-6 inducing activity were collected for further analysis and separation on a 
HiLoadTM 16/60 SuperdexTM 200 gel filtration column (GE Healthcare) eluted 
with 20 mM Tris-150 mM NaCl pH 8.0 buffer. Molecular masses were estimated 
by calibrating the column with a molecular mass marker kit (Sigma). Active 
fractions were pooled and analysed by SDS-polyacrylamide gel electrophoresis 
and mass spectrometry. 

Mass spectrometry. Samples were separated on a 1D NuPage 4-12% Bis-Tris 
gradient gel (Invitrogen) in MOPS buffer (Invitrogen) and stained with GelCode 
Blue. Protein bands were excised and in-gel digested with trypsin. The resulting 
peptides were extracted and separated on an LC Packings liquid chromatographic 
system equipped with a 75-j1m C18 column at a flow rate of 300 nl min '. The 
column was interfaced through a nanospray source directly into a QSTAR XL 
qQTOF mass spectrometer. A 1-s survey scan was followed by 3-s product ion 
scans for the two most abundant parent ions. The resulting mass spectrometry 
data were analysed using MASCOT searching of the Swiss Prot database. The 
MASCOT search was performed allowing one miscleavage and common modi- 
fications of methionine oxidation and amino-terminal pyroglutamic acid. 
Statistical analyses. Results are expressed as means + s.e.m. or s.d. Data were 
analysed by Student’s t-test and two-way analysis of variance using the GraphPad 
Prism statistical program. P values < 0.05 were considered significant. Error bars 
depict s.e.m. or s.d. 
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Chaperonin complex with a newly folded protein 
encapsulated in the folding chamber 


D. K. Clare’, P. J. Bakkes’+, H. van Heerikhuizen’, S. M. van der Vies” & H. R. Saibil’ 


A subset of essential cellular proteins requires the assistance of 
chaperonins (in Escherichia coli, GroEL and GroES), double-ring 
complexes in which the two rings act alternately to bind, encap- 
sulate and fold a wide range of nascent or stress-denatured 
proteins’ °. This process starts by the trapping of a substrate pro- 
tein on hydrophobic surfaces in the central cavity of a GroEL 
ring®”®. Then, binding of ATP and co-chaperonin GroES to that 
ring ejects the non-native protein from its binding sites, through 
forced unfolding or other major conformational changes, and 
encloses it in a hydrophilic chamber for folding''*. ATP hydro- 
lysis and subsequent ATP binding to the opposite ring trigger 
dissociation of the chamber and release of the substrate protein’. 
The bacteriophage T4 requires its own version of GroES, gp31, 
which forms a taller folding chamber, to fold the major viral capsid 
protein gp23 (refs 16-20). Polypeptides are known to fold inside 
the chaperonin complex, but the conformation of an encapsulated 
protein has not previously been visualized. Here we present struc- 
tures of gp23-chaperonin complexes, showing both the initial 
captured state and the final, close-to-native state with gp23 encap- 
sulated in the folding chamber. Although the chamber is 
expanded, it is still barely large enough to contain the elongated 
gp23 monomer, explaining why the GroEL-GroES complex is not 
able to fold gp23 and showing how the chaperonin structure dis- 
torts to enclose a large, physiological substrate protein. 

Chaperonin-substrate (binary) complexes were formed by rapidly 
mixing urea-denatured gp23 with GroEL. Ternary complexes were 
generated by adding gp31 (bacteriophage T4 GroES homologue) and 
the ATP transition state analogue ADP-AIF3. Cryo-electron micro- 
scopy (cryo-EM) data sets of 30,000—35,000 particles were collected 
of both preparations, and initial three-dimensional maps were 
obtained by treating each data set as a single structure with seven- 
fold symmetry. The resulting maps showed GroEL and GroEL-gp31 
complexes with some additional densities in the binding cavities 
(Supplementary Fig. 1). As in our earlier study on malate dehydro- 
genase (MDH) folding, we expected the non-native substrate to form 
heterogeneous and asymmetric complexes with the chaperonins. 
Therefore we used a combination of multivariate statistical analysis 
(MSA) and competitive projection matching to sort the images into 
more homogeneous classes and determine their structures, without 
imposing any symmetry'®”'. 

The binary complexes were resolved into five classes 
(Supplementary Fig. 2, Supplementary Table 1). In agreement with 
our mass spectrometry results’, we observed empty GroEL (20% of 
the images) and GroEL with extra density in either one (40%) or both 
rings (40%). The classes displaying the largest fraction of the substrate 
density are shown in Fig. 1, with the GroEL subunit domains fitted 
into the maps. The GroEL rings deviate little from seven-fold sym- 
metry (Fig. 1c, d, g, h and rotational correlation analysis, not shown). 


This is unlike GroEL-MDH complexes in which the GroEL apical 
domains are bunched together on the side of the ring where MDH 
binds’. The large gp23 densities contact at least five of the seven 
GroEL apical domains in the ring, and the density is located deep 
inside the cavity, mainly around helix I and the underlying hydro- 
phobic segment’ (Fig. 1b, c, f-h). Whether substrate is bound or not, 
the open rings are very similar to one another and to apo GroEL. 
Most of the ternary complexes fell into three well-defined struc- 
tural classes (Fig. 2, Supplementary Table 1): ‘empty’ (no apparent 
substrate density, 33% of the images), substrate bound in the trans 
(open) ring (32%), and substrate bound in both cis and trans rings 
(35%). Consistent with the presence of the ATP analogue, the trans 
ring of all three complexes displayed the same intra-ring B-sheet 
contact between neighbouring equatorial domains as in the 
GroEL-GroES-ATP complex”, although there is some variation 
in orientation of the apical domains. Less substrate density is resolved 
in the trans ring than in the binary complexes but it is found in a very 
similar location, that is, deep in the cavity and interacting with most 
of the apical domains (Fig. 2e, f, h and i). In the cis chamber, the 
substrate density fills most of the space, without strong contacts to 
either GroEL or gp31 (Fig. 2h). At lower density thresholds, the cis 
substrate density extends to contact several regions of the gp31 lid 
and the GroEL apical domains, potentially exerting force on the 
chamber (not shown). The atomic structure of gp23 is not known, 
but sequence alignment and genetic analysis have shown that it is 
closely related to the T4 vertex protein, gp24, for which the structure 
has been determined”. The encapsulated gp23 density is remarkably 
similar in shape to the major domain of gp24 (Fig. 3c). 
Unexpectedly, we found that the cis chamber of the trans-occupied 
complex is slightly compressed, estimated as a 7% decrease in 
volume, and is significantly expanded in the cis/trans occupied com- 
plex, with a 12% increase in volume, relative to the cis chamber in the 
empty complex. This can be seen by the differences in density of the 
mobile loops of gp31, which interact with the apical domains of 
GroEL (arrows, Fig. 3a—c), and in the cross-sections through the cis 
apical domains which are almost losing contact at some positions 
(Fig. 3d—-f). The expansion suggests that the folding substrate is exert- 
ing pressure on the folding chamber. No density is visible for the 
small, mobile insertion domain of gp23, but there is space at the base 
of chamber to accommodate it in a disordered form (Fig. 3c). The 
compression of the folding chamber in the trans-bound complex 
reveals an allosteric effect of the trans substrate, and suggests a mech- 
anism for newly bound substrates to prime dissociation of the folding 
chamber. The nature of this allosteric effect was surprising, because 
the connections between GroEL and gp31 are stronger, rather than 
weaker as expected in preparation for the release of gp31. 
Extracting the gp23 densities from the maps revealed remarkably 
well-defined structures for both the initially captured, non-native 
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Figure 1| Asymmetric reconstructions of GroEL with non-native gp23 in 
one or both rings. a—d, Binary complex with gp23 in one ring (red density), 
with the crystal structures of GroEL domains fitted into the maps, shown 
from the side (a), as a central section (b), from the top (c) and from the 
bottom (d). e-h, Same views of the binary complex with gp23 in both rings 
(red density). The EM density maps were sharpened between 20 and 10 A. 
Automated docking of the atomic coordinates of the 42 GroEL domains as 
rigid bodies into each complex gave excellent fits to the maps, with hinge 


Cis/trans 
ternary 


Figure 2 | Asymmetric reconstructions of GroEL-gp31 without visible 
substrate, with gp23 in the open ring and with gp23 in both rings. 

a-—c, Empty ternary complex, with the crystal structures of GroEL and gp31 
fitted as 49 individual domains, shown from the side (a), as a central section 
(b) and from the bottom (c). d-f, Same views of the trans-bound ternary 
complex. g—i, Same views of the cis/trans bound ternary complex. Density of 
gp23 in the trans rings is shown in red and in the cis ring in green. The 
resolution of the maps is around 10 A at 0.5 Fourier shell correlation, and 
they were therefore sharpened between 20 and 10 A. The GroEL and gp31 
domain coordinates fit very well into the density maps, with only a few 
minor mismatches. 
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residues of neighbouring domains in proximity to each other, except for a 
few regions such as the intermediate domains of some subunits. Helices H 
and | are shown as cyan cylinders. The carboxy termini of some subunits are 
visible and either contact the substrate (b, upper ring) or bend away from it 
(f, lower ring). Interaction of the flexible GroEL C termini with substrates is 
consistent with earlier reports”*°. The diameter of GroEL complexes is 
around 140A and the resolution of the maps is around 11 A at 0.5 Fourier 
shell correlation. 


form and the almost fully folded state in the folding chamber of the 
GroEL-gp31 complex (Fig. 4). The cryo-EM density for gp23 bound 
to the top rings of the different binary complexes represents over half 
of the native molecular volume (Fig. 4a, b), showing that the non- 
native protein is constrained in position and shape when bound to 
the chaperonin complex (disordered regions are not seen in these 
averaged structures). The substrate densities in the second ring of 
doubly occupied complexes (bottom rings) are less well defined, 
possibly because one substrate dominates the alignment and classi- 
fication, and the second substrate is not necessarily in a fixed position 
relative to the first. In the trans occupied ternary complex, the obser- 
vation of less substrate density suggests that it is less well ordered in 
this class (Fig. 4c). In the cis complex, the encapsulated density is 
clearly recognizable as a low resolution version of the major domain 
in the gp24 crystal structure (Fig. 4d, e). 

A recent FRET (fluorescence resonance energy transfer) study 
shows that another large, stringent GroEL substrate, Rubisco, occu- 
pies a more compact conformation in the trans ring of a ternary 
complex than in the open ring of a binary complex"*. Our findings 
do not show a more compact conformation for gp23 in the trans ring 
of the ternary complex. Moreover, our mass spectrometry data shows 
that Rubisco binds to GroEL with strong negative cooperativity 
between the rings, but that gp23 binds to both rings**’*. Therefore, 
these two large substrates may have distinct modes of interaction 
with GroEL. A diversity of GroEL-substrate interaction modes can 
be anticipated, depending on the folding pathways and intermediates 
of different substrate proteins. 

It is estimated that all of the GroEL in E. coli would be required to 
fold the large number of gp23 subunits produced during T4 infec- 
tion’. It has been proposed that gp23 monopolizes the cell’s GroEL 
through specific amino-terminal regions of gp23 that pause its trans- 
lation and target the nascent chain to GroEL”’. Our observation of 
well-defined electron density for the large domain of gp23 bound to 
GroEL, in a relatively small number of structural classes, is consistent 
with the notion that there is a specific binding region on gp23 that 
targets it to GroEL, leading to a preferred mode of binding. 

In our previous study of GroEL-MDH complexes, most of the 
MDH density was observed in a similar region of the GroEL surface 
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Figure 3 | Folding chambers of the GroEL-gp23-gp31 complexes. a-—c, Side 
view sections and d-f, cross-sections through the apical domains of the 
folding chambers (at the position of the red dotted lines in a—c). Shown are 
views of the empty complex (a and d), the trans-occupied complex (b and 
e) and the cis/trans occupied complex (c and f). The red arrows in a—c show 
the loss of density at the contact between gp31 and GroEL. Correspondingly, 
the dotted red circles in d-f are all the same size (45 A in diameter) and 


to what we report here for gp23, namely the lower part of helix I and 
the underlying segment’®. But MDH (33kDa) only interacts with 
three of the seven apical domains, whereas gp23 (56 kDa) contacts 
at least five of the seven sites and occupies more of the binding cavity 
than MDH. The more asymmetric interaction of MDH with the 
cavity appears to perturb the GroEL symmetry in both rings’®. The 
internal location of the substrate around helix I and the underlying 
segment leaves the helix H/I groove accessible to the mobile loop of 
the co-chaperonin even with a large substrate protein bound in the 
open cavity. However, it was reported that glutamine synthetase 
(51 kDa) occupies a more external binding site**. It is possible that 
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Figure 4 | Substrate densities isolated from the binary and ternary 
complexes. Shown are top and side views of substrate densities isolated 
from the GroEL-gp23 binary complex (a), the GroEL—(gp23), binary 
complex (b), the trans-only GroEL-gp23-gp31 ternary complex (c) and the 
cis ternary complex (d) compared to the low resolution filtered density of the 
gp24 crystal structure (e). The isolated substrate densities were low-pass 
filtered at 15 A and their approximate molecular masses were determined at 
a density threshold of 1o of the complete complex. A larger observed mass in 
the class average reflects a more homogeneous class and therefore a more 
consistent structure for that sub-population. The green line in e indicates the 
major domain of gp24. 
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highlight the expansion of the apical domain ring in the cis/trans complex 
and the contraction of the trans-occupied apical domain ring. The major 
domain of the gp24 structure fits very well into the cis substrate density 
(c, f), unlike the mobile insertion domain of gp24 in the extended 
conformation seen in the crystal structure, where it makes an inter-subunit 
contact. In this position it clashes with the GroEL C termini, but there is 
clearly space available for it closer to the major domain. 


the external binding site preferentially binds more folded states, sim- 
ilar to that suggested for one class of MDH complexes’. 

Once gp23 is bound to GroEL, gp31 (but not GroES) can encap- 
sulate and fold the capsid protein’’. The inability of GroES to encap- 
sulate gp23 can be explained by the elongated shape adopted by newly 
folded gp23 inside the GroEL-gp31 chamber, and implies that the 
folding intermediate formed upon ATP and co-chaperonin binding 
is too bulky or too extended for GroES to bind. In this respect, it 
should be noted that gp31 has longer mobile loops and a larger 
internal space, which we show here to be stretched to the limit in 
order to encapsulate gp23. 

The present work gives the first visualization of a newly folded sub- 
strate protein trapped in a largely native conformation inside the 
folding chamber of GroEL. The well-defined gp23 density in the 
GroEL-gp31 cis chamber unexpectedly shows that this physiological 
substrate protein is trapped in a unique position and orientation in the 
chaperonin chamber. The combination of size and shape of the cis 
chamber and gp23 results in the gp23 major domain being wedged 
into the chamber in a restricted position. The small insertion domain, 
which makes inter-subunit contacts in the viral capsid”, is likely to be 
mobile and disordered until assembly of the capsid hexamers, which 
form as soon as the folded gp23 is released from GroEL-gp31 (ref. 22). 
The encapsulated substrate causes a significant expansion of the folding 
chamber in the region of the apical domains, and is exerting pressure on 
the connections between GroEL and gp31 and the inter-subunit con- 
tacts between apical domains of the cis ring. Thus, even though the 
folding chamber formed by GroEL-gp31 is larger than that formed by 
GroEL-GroES, an enclosed gp23 monomer still exerts pressure on it. 
Previous studies of GroEL—GroES and GroEL-gp31 complexes in vari- 
ous nucleotide states did not reveal any significant changes in the 
conformation of the folding chamber*** (see also Supplementary 
Results and Supplementary Discussion). The expansion and compres- 
sion observed here show that folding substrates directly and indirectly 
affect the conformation of the chamber. In conclusion, this study 
reveals a remarkable view of the chaperonin folding chamber strained 
to encapsulate a physiological substrate protein. 
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METHODS SUMMARY 


The GroEL, gp23 and gp31 were expressed and purified as described’’. The 
binary GroEL-gp23 complexes were prepared as described”’, yielding final con- 
centrations of 1 tM GroEL oligomer, 2.5 uM gp23 monomer and 100 mM urea. 
GroEL-gp23-gp31 ternary complexes were formed by adding a twofold molar 
excess of gp31 heptamer over GroEL tetradecamer to preformed binary com- 
plexes, along with ADP to give a final concentration of 2.5 mM. After incubation 
at 24 °C, KF and KAI(SO4)3 were added, giving final concentrations of 20 mM KF 
and 2mM KAI(SO,)2, to form the ATP transition state analogue ADP- AIF; to 
generate folding-active complexes. 

Images were recorded on a 200 kV FEG microscope on photographic film and 
processed at 2.8 A per pixel, with final data sets of 30,000 and 35,000 side views of 
the binary and ternary complexes, respectively. A starting model for the binary 
complex was obtained by angular reconstitution, and our previously determined 
GroEL-ADP-gp31 structure” was used as a starting model for the ternary com- 
plexes. The data sets were sorted into classes showing different substrate features 
by a combination of MSA and competitive projection matching", and the 
atomic structures of the GroEL subunit domains, gp31 and gp24 subunits were 
docked into the final, asymmetric maps as separate rigid bodies. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Sample preparation. Binary complexes. GroEL was diluted to a concentration 
of 1.2 uM (oligomer) in 40 mM Tris-HCl pH 7.5, 10mM KCl, 10mM MgCh. 
25 1M gp23 monomer was denatured in 6 M urea for 2 h at 24 °C and 300 r.p.m. 
on a bench top shaker. The denatured gp23 was then added to 90 pl of GroEL 
solution in ten 1-1 additions, with vortexing and centrifugation after each 
addition, giving a final concentration of 14M GroEL, 2.5uM gp23 and 
600 mM urea. To maximize the occupancy of the gp23—GroEL bound complex, 
the denatured gp23 was added in a 2.5 molar excess over GroEL oligomer. The 
urea was diluted to 100 mM by addition of 500 ul of buffer to the GroEL-gp23 
complex. This mixture was then concentrated in a Vivaspin with a 5 kDa cut-off, 
to a final concentration of 1 4M GroEL oligomer, 2.5 uM gp23 monomer and 
100 mM urea. 

Ternary complexes. The non-hydrolysable ATP analogue ADP-AIF; has been 
shown to produce folding active ternary complexes for several GroEL sub- 
strates’. We therefore used it to produce folding-active GroEL-gp23- 
gp31:ADP-AIF; complexes. 0.75 ul of 140 uM (heptamer) gp31 was added to 
50 ul of binary complex prepared as above, followed by 0.3 pl of 400 mM 
ADP. The mixture was incubated for 10 min at 24°C. Then 1 pl of 1 M KF was 
added and mixed, followed by 1 ul of 100 mM KAI(SOq)2, mixing and 5 min 
incubation. To maximize the number of ternary complexes, a twofold molar 
excess of gp31 over GroEL was added to the preformed binary complexes. 
Data collection. Samples (3.5 p11) were loaded on to freshly glow discharged 
(45 s) c2/2 holey carbon C-flat grids (Protochips Inc.), blotted and plunged into 
liquid nitrogen cooled liquid ethane. Low dose images were recorded on an F20 
microscope (FEI) at 50,000 on Kodak SO-163 film with a defocus range of 
1-3 um. 

The images showed an even distribution of side and end views very similar to 
that seen for GroEL alone. The number of side views is crucial to the study of 
GroEL complexes as they contain all the information required to reconstruct the 
molecule in three dimensions. In contrast, for the GroEL-MDH binary complex, 
a modified GroEL had to be used to increase the number of side views’°. 
Image processing. The images were digitized at a step size of 7 1M on a Zeiss 
photodensitometer (Intergraph), giving a sampling of 1.4 A per pixel. The defo- 
cus was determined with CTFFIND3”. Particles were manually picked with 
XIMDISP and extracted into 512 X 512 pixel boxes with LABEL”. The cut- 
out particles were corrected for the phase reversals of the contrast transfer func- 
tion (CTF) in SPIDER™, cropped to 320 X 320 pixels and 2 X 2 averaged to givea 
final sampling of 2.8 A per pixel. The boxed particles were filtered between 200 
and 6A for the binary complexes and between 285 and 6A for the ternary 
complexes and then normalized. The images were centred in SPIDER using side 
and end view projections of the crystal structures of GroEL and GroEL—GroES 
that were low pass filtered to 40 A. The aligned images were then classified using 
MSA in IMAGIC into classes each containing on average five images*. Using 
these classes, end views, bad images and GroEL lacking gp31 in the ternary 
complex sample were removed to produce two data sets containing 30,000 
and 35,000 side view images of the binary and ternary complexes, respectively. 
The binary data set, including an end view class, was aligned and initially recon- 
structed by angular reconstitution in IMAGIC with imposed seven-fold sym- 
metry, treating it as a single structure. 

Binary complexes. The initial map was used as a starting model for projection 
matching in SPIDER, with the number of references increasing to 155 (90-105° 
in f and 0-51.4° iny) which gave a y spacing of 2.1°. The resulting reconstruction 
was masked to generate two reference structures, one without substrate density 
and the other containing one-third of the sevenfold averaged substrate density. 
At this stage the symmetry was relaxed from C7 to Cl and the reconstructions 
were used for competitive projection matching, in which the images are sorted by 
cross-correlation value to projections of both reference structures. The images 
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that aligned to the substrate-bound reference were then subjected to MSA, and 
two eigenimages that represented substrate variance were used to separate the 
images into two populations of substrate-bound structures, GroEL-gp23 and 
GroEL-(gp23),, which, along with the empty structure, were used for further 
competitive projection matching. 

For the asymmetric refinement, the number of reference images was increased 

from 155 to 528 per reconstruction for further competitive alignment with an 
angular spacing of 3° from 78-90° in f and 3.1° from 0-360°in y and with the 
mirror option turned on in the SPIDER operation AP SH. After another three 
rounds of alignment, when the image alignments were stable for each of the three 
species, they were subjected to MSA. This led to a further separation to give one 
empty, two GroEL-gp23 and two GroEL—(gp23), species. The separation was 
refined again by projection matching. When the alignment had stabilized, 95% 
of the images aligned to the same references in consecutive alignments. MSA was 
carried out on each of the subsets to check that the substrate variations had been 
minimized (Supplementary Fig. 3a—c). The total number of images in the data set 
limited the number of different complexes that could be discriminated and 
therefore no more separations were carried out. The resolutions of the five final 
reconstructions, determined by the 0.5 criterion of the Fourier shell correlation, 
were 11-12 A. 
Ternary complexes. The initial model used to align the ternary complexes was 
the previously published GroEL-gp31—ADP structure”, re-projected at an angu- 
lar spacing of 2.1° (90-110° in f and 0-51.4° in y). The resulting seven-fold 
symmetric reconstruction contained density for both cis and trans bound sub- 
strate and was used for further alignment and MSA, eventually yielding five 
classes. Classes 1-3 together represented 70% of the data and contained well 
aligned images, whereas classes 4 and 5 contained poorly aligned images. After 
this separation we continued the alignment procedure with classes 1, 2 and 3 as 
independent data sets. The alignment of these three classes stabilized quickly 
with 95% of the images aligning to the same reference in consecutive rounds, and 
the calculated eigenimages showing that the variation due to substrate had been 
removed (Supplementary Fig. 3d-f). 

Once the alignment had stabilized for each of the complexes the symmetry was 

relaxed from C7 to Cl and the number of references increased to give a final 
angular spacing of 2° (90-110° in f and 0-360° in y and with the mirror option 
turned on in the SPIDER operation AP SH). The refinement was stopped when 
more than 95% of the images aligned to same reference in consecutive rounds of 
alignment. The images of the cis/trans bound complex were also aligned to the 
trans only bound references and the resulting reconstruction had almost ident- 
ical cis substrate density, confirming the reliability of the cis substrate density. 
The resolutions of the three maps were all around 10A by the 0.5 Fourier shell 
correlation criterion. 
Docking of atomic coordinates. As there was no symmetry applied to the 
reconstructions, all 14 subunits of GroEL (1GRL and 1AON) and 7 subunits 
of gp31 (1G31) were fitted individually. Each of the GroEL subunits was further 
split into its three separate domains, apical 192-373, intermediate 137-191 and 
373-409, and equatorial 2-136 and 410-525, which were docked as rigid bodies. 
The mobile loops of gp31 (residues 15-36) were omitted since they were not in 
the GroEL-bound conformation in the crystal structure. The initial docking was 
done manually in Pymol (www.pymol.org) and then refined by the automatic 
docking program URO”*. However the fitting of the intermediate domains was 
not accurate since the density for these domains in most of the maps was not as 
well defined as that of the other domains. Therefore, the intermediate domains 
had to be checked and manually corrected to maintain the connectivity between 
the equatorial and intermediate domains. After adjustment of the intermediate 
domains all the connections between it and the equatorial and apical domains 
were within 7 A (a-carbon distance), with the majority less than that. 
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X-ray structure of a pentameric ligand-gated ion 
channel in an apparently open conformation 


Nicolas Bocquet'*, Hugues Nury’”*, Marc Baaden’, Chantal Le Poupon’, Jean-Pierre Changeux°, Marc Delarue” 


& Pierre-Jean Corringer’ 


Pentameric ligand-gated ion channels from the Cys-loop family 
mediate fast chemo-electrical transduction’ ’, but the mechanisms 
of ion permeation and gating of these membrane proteins remain 
elusive. Here we present the X-ray structure at 2.9 A resolution of 
the bacterial Gloeobacter violaceus pentameric ligand-gated ion 
channel homologue‘ (GLIC) at pH 4.6 in an apparently open con- 
formation. This cationic channel is known to be permanently acti- 
vated by protons’. The structure is arranged as a funnel-shaped 
transmembrane pore widely open on the outer side and lined by 
hydrophobic residues. On the inner side, a 5A constriction 
matches with rings of hydrophilic residues that are likely to con- 
tribute to the ionic selectivity®°. Structural comparison with 
ELIC, a bacterial homologue from Erwinia chrysanthemi solved 
in a presumed closed conformation", shows a wider pore where 
the narrow hydrophobic constriction found in ELIC is removed. 
Comparative analysis of GLIC and ELIC reveals, in concert, a 
rotation of each extracellular B-sandwich domain as a rigid body, 
interface rearrangements, and a reorganization of the transmem- 
brane domain, involving a tilt of the M2 and M3 a-helices away 
from the pore axis. These data are consistent with a model of pore 
opening based on both quaternary twist and tertiary deformation. 

Pentameric ligand-gated ion channels (pLGICs) are allosteric pro- 
teins regulating cellular excitability through the opening of an 
intrinsic transmembrane ion channel. Agonist binding to extracellu- 
lar sites shifts a closed conformation into an open one, allowing ions 
to diffuse down their electrochemical gradient. Two agonist-free 
structures have been reported: the Torpedo marmorata nicotinic 
acetylcholine receptor (nAChR-Tm), solved by electron microscopy 
at 4A resolution''”, and the 3.3 ELIC X-ray structure’. X-ray 
structures of homologues of the extracellular domain (ECD) of 
nAChRs have also been described: the acetylcholine-binding proteins 
(AChBPs) co-crystallized with agonists and antagonists’*°, and the 
ECD of «1-nAChR"*. A key issue is now to understand how pLGICs 
open, select and translocate ions through the membrane. Most 
pLGICs undergo desensitization on prolonged exposure to agonist, 
complicating structural investigations of the transient open con- 
formation. In contrast, GLIC is activated by protons but does not 
desensitize, even at proton concentrations eliciting the maximal elec- 
trophysiological response (pH 4.5)°. Here we present the first appar- 
ently open structure of this family, GLIC crystallized at pH 4.6. 

The overall architecture of GLIC is similar to those of ELIC, the 
AChBPs, and nAChR- Tm (Fig. la). The five subunits are arranged in 
a barrel-like manner around a central symmetry axis that coincides 
with the ion permeation pathway (Fig. 1b). Subunits interact tightly 
through a 2,200 A? interface containing charged residues and water 
molecules (Supplementary Fig. 8). The ECD of each subunit consists 


of a B-sandwich composed of five inner and three outer strands 
connected by loops (Fig. 1c). In eukaryotic pLGICs, the interface 
between ECDs holds the neurotransmitter-binding pocket”'*. The 
equivalent region in GLIC presents structural similarity despite a 
low sequence identity, and is well defined, notably the capping B9- 
B10 loop (loop C)'*. The transmembrane domain (TMD) of each 
subunit consists of four helices (M1 to M4). M2 helices form the wall 
of the pore (Fig. 2a), bordered by rings of homologous residues, as 
previously established in nAChRs": a charged E—2', two polar T2’, 
S6’, and three hydrophobic 19’, A13’ and 116'/L17’ rings, that are 
close/homologous to «1-nAChR-Tm E—1', T2', $6’, L9’, V13' and 
L16'/V17' (the prime numbering starts at approximately the begin- 
ning of M2, at positions homologous to «1K242 of nAChR-Tm, 
Fig. 2c). M1 helices are kinked at P205 and form with M3 a second 
circle of helices interacting with M2. M4 helices are peripheral. Well- 
defined electron densities are observed in the grooves between M4 
and both M1 and M3, close to residues labelled by hydrophobic 
probes on nAChR-Tm'*. They were attributed to lipids that possibly 
contribute to holding M4 in its position (Fig. 1b, d, and 
Supplementary Fig. 3). The central ion permeation pathway consists 
of an extracellular hydrophilic vestibule more than 12 A wide, fol- 
lowed by a funnel-shaped transmembrane pore (Fig. 2d). The M2 
axes are tilted with respect to the pore axis, with outer hydrophobic 
side chains oriented towards the helix interfaces, and inner polar side 
chains oriented towards the pore. 

A bundle of six detergent molecules (dodecyl-B-p-maltoside, 
DDM) obstructs the pore, with one detergent per monomer and 
one sitting on the five-fold axis, the sugar moiety being much less 
ordered than the aliphatic chains (Fig. 1b, d, Supplementary Fig. 3). 
The former interact extensively with the hydrophobic rings, shielding 
their side chains from the solvent with the polar heads pointing up, 
towards the vestibule, while the latter points its polar head down. To 
probe the influence of the detergent on the protein conformation, 
diffraction data have been collected with crystals grown in the pres- 
ence of two DDM analogues (Supplementary Fig. 4). These detergents 
contain two bulky bromine atoms in their alkyl chain and could not 
bind all in the same position as described here (Fig. 1) without ser- 
iously disturbing the pore structure, for steric reasons. In both cases 
we observe that the protein conformation is unchanged (r.m.s. devi- 
ation <0.3 A), whereas either the central detergent is not seen in the 
electron density (10,11-dibromoundecanoyl-f-maltoside) or the five 
central detergent-binding sites are only partly occupied (7,8-dibro- 
mododecyl-B-maltoside). Furthermore, we performed molecular 
dynamics simulations (20 ns) of the GLIC molecule in a lipid bilayer 
without DDM in the pore. We observe that its conformation is stable 
throughout the simulation (Supplementary Fig. 6). 
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Figure 1| GLIC structure. a, Ribbon representation of GLIC viewed from 
the plane of the membrane. DDM molecules bound in the channel are 
depicted as yellow sticks plus van der Waals surface. Horizontal lines 
represent the membrane limits. b, Transmembrane part of GLIC viewed 
from the extracellular side. The ECD is removed for clarity. Lipids are also 


The pore of GLIC resembles a cone with 12A outside and 5A 
inside diameters, sharply contrasting with that of ELIC, where outer 
residues L9’, A13’ and F16’ point their side chains towards the cent- 
ral axis, creating a 2A diameter hydrophobic barrier expected to 
prevent ion permeation (Fig. 2b). This wide opening of the pore is 
suggestive of an open structure. Three lines of evidence support this 
idea. First, GLIC was solved in the presence of a non-desensitizing 
agonist. Second, the GLIC pore structure is consistent with a wealth 
of biochemical data obtained on the open conformation of nAChRs 
that located the channel constriction between residues —2' and 2' 
(refs 19, 20), from measurement of cysteine accessibility’? and chan- 
nel conductance decreases following positively charged residue sub- 
stitution”. Yet, permeability studies with organic cations suggest that 
this constriction is 7.5 A wide in nAChRs’, while it is only 5 A wide in 
GLIC. Intrinsic flexibility of the side chains composing the constric- 
tion could compensate for this difference and transiently allow for 
accommodation of large cations. Third, molecular dynamics simula- 
tions based on the nAChR-Tm confirm that it corresponds to an 
impermeant structure, with rings 9’ and 13’ creating a hydrophobic 
barrier, but show that small increases (less than 2A) in diameter 
would produce a permeant channel*’””. The GLIC diameter is sig- 
nificantly wider than that of nAChR-Tm at these positions, further 
highlighting its compatibility with cation permeation. 
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depicted in orange. ¢, Topology of a GLIC subunit. The conserved core 
elements common to GLIC and ELIC are coloured in yellow. d, Close-up 
view of the TMD. Only two subunits are represented. The DDM molecules 
and the lipids (named LIP601/2/3) close to these subunits are coloured 
according to their atomic B-factor (colour scale at top). 


Interestingly, mutagenesis has pointed to a key role of the —2'/2' 
region in determining charge selectivity®°; a ring of negatively 
charged residues at —1’ and an additional ring of prolines at —2' 
respectively favour cationic and anionic selectivity of eukaryotic 
pLGICs. Thus, the —2’/2' region was proposed to constitute the 
selectivity filter of the pore’ ’, an idea consistent with our structure, 
where the constriction is formed by the T2’ hydroxyl moieties, 
flanked on both sides by the S6’ hydroxyl and E—2’ carboxyl moi- 
eties. The 5 A diameter at T2’ is too narrow to fit a fully hydrated ion, 
as indicated by the average distance between sodium or potassium 
and protein side-chain oxygen atoms (respectively 2.4 and 2.85A 
between atom centres”’). Cation permeation would require the loss 
of equatorial water molecules; yet rings of polar side chains are 
located in the right position to transiently coordinate cations and 
could compensate for this loss. Regarding the pore conformation, 
data are thus consistent with the notion that GLIC and ELIC struc- 
tures are in presumed open and closed conformations, respectively, 
while that of nAChR- 7m might be in a functionally closed but struc- 
turally nearly open conformation. 

Assuming that GLIC and ELIC structures faithfully represent the 
open and closed forms of pLGICs embedded in a phospholipid 
bilayer, insight into the opening mechanism can be provided by 
analysing their rearrangements, even though they share only 18% 
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Figure 2 | GLIC and ELIC pores. a, Top and side view of GLIC M2 helices (top M2 helices. ¢, M2 sequence alignment. d, Pore radius for 71-nAChR-Tm 
and bottom panels, respectively); helix backbones and side chains facing the (blue), ELIC (red) and GLIC (green) along its axis. The upper part is a mesh 
pore are depicted. Hydrophobic, polar and negative residues are coloured representation at the same scale of the GLIC channel. The electrostatic 
yellow, green and red respectively. In the side view, only two subunits are potential is represented as a dotted green line. Black and yellow arrows 
shown. DDM molecules are removed for clarity. b, Top and side views ofELIC _ indicate the limits of the membrane and the hydrophobic rings, respectively. 
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Figure 3 | Open GLIC and closed ELIC structure comparison. a, Side viewof | The M4 helix is omitted for clarity. b, Close-up view of the interface between 
the structural superposition. For the two subunits in the foreground, only the ECD and the TMD (side view in left panel and upper view in right panel). 
the common core is depicted, in green for GLIC, in red for ELIC. Other c, Close-up of transmembrane helices M1—M3 viewed from the channel. 
subunits are in grey. The ECD rotation axes and the twist axis are depicted. 


113 
©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


sequence identity. Several elements can be unambiguously aligned 
with no gap or insertion, with identical secondary structure 
(Supplementary Fig. 2): helices M1, M2 and M3, and a large portion 
of the B-sandwich consisting of strands B1, B2, B6, B7 and B10, also 
well conserved in the AChBPs. These elements constitute the subunit 
“common core’. Common core superimposition shows that the GLIC 
subunits display a quaternary twist compared to ELIC, with anti- 
clockwise (versus clockwise) rotation in the upper (versus lower) 
part of the pentamer, when viewed from the extracellular compart- 
ment (Fig. 3a). This is confirmed by normal mode analysis: the lowest 
frequency mode is precisely a twist mode and has by far the highest 
contribution (29%) to the transition (Supplementary Fig. 7a). 
However, we note that the first 100 lowest-frequency modes (usually 
the most collective ones) only explain about 50% of the transition. 
Other and more local movements occur: in the TMD, the outer ends 
of M2 and M3 of GLIC are tilted away radially from the channel axis, 
while the outer end of M1 is fixed. The inner ends of M1, M2 and M3 
move tangentially towards the left, when viewed from the membrane 
(Fig. 3b). In the ECD, the core of the B-sandwich undergoes little 
deformation (Supplementary Table 2), but is rotated by 8° around an 
axis roughly perpendicular to the inner sheet of the B-sandwich 
(Fig. 3a), concomitant with a rearrangement of both the subunit— 
subunit and the ECD/TMD interfaces, regions known to contribute 
to neurotransmitter gating**’’. The latter contains the well-con- 
served 86-87 and M2—M3 loops and the B1-B2 loop whose length 
is conserved in the pLGIC family. We observe a downward motion of 
the B1-B2 loop, concomitant with a displacement of the M2—M3 
loop, M2 and M3 helices and B6—f7 loop towards the periphery of 
the molecule (Fig. 3c), thereby opening the pore. 

This mechanism is different from a more local gating pathway 
suggested by the comparison of the conformations of the « and 
non-o subunits in nAChR-Tm”. It implies both a quaternary twist 
and tertiary deformations. Such twist to open motions, initially pro- 
posed from ab initio normal mode analysis of nAChRs”, and 
observed for Kcsa”, may plausibly be extended to eukaryotic 
pLGICs. The structural transition described here couples in an allos- 
teric manner the opening—closing motion of the pore with distant 
binding sites—located at the ECD subunit interface for neurotrans- 
mitters, or within the TMD for allosteric effectors*°—and may pos- 
sibly serve as a general mechanism of signal transduction in pLGICs. 


METHODS SUMMARY 


GLIC was expressed as described previously’, and solubilized/purified in 2%/ 
0.02% DDM including an amylose affinity chromatography, two size exclusion 
chromatographies and thrombin cleavage of fused MBP. Crystals were grown at 
pH 4.6, with 15% PEG 4000, 100 mM NaOAc and 400 mM (NHg,)SCN in hang- 
ing drops. Data were collected at MX beamlines of the European Synchrotron 
Radiation Facility, and processed with XDS and CCP4_ programs 
(Supplementary Table 1). The spacegroup is C2 with one pentamer in the asym- 
metric unit (Supplementary Fig. 1). Molecular replacement with Phaser using 
the ELIC structure gave the first density maps. Refinement was conducted with 
Coot and Refmac, using tight NCS restraints. The final model presents a good 
geometry and consists of residues 6-315 for the 5 subunits plus 6 DDM and 15 
partial lipid molecules, and 115 water molecules (Supplementary Fig. 3). 
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Structure of a potentially open state of a 
proton-activated pentameric ligand-gated ion 


channel 


Ricarda J. C. Hilf! & Raimund Dutzler’ 


The X-ray structure of a pentameric ligand-gated ion channel from 
Erwinia chrysanthemi (ELIC) has recently provided structural 
insight into this family of ion channels at high resolution’. The 
structure shows a homo-pentameric protein with a barrel-stave 
architecture that defines an ion-conduction pore located on the 
fivefold axis of symmetry. In this structure, the wide aqueous 
vestibule that is encircled by the extracellular ligand-binding 
domains of the five subunits narrows to a discontinuous pore that 
spans the lipid bilayer. The pore is constricted by bulky hydro- 
phobic residues towards the extracellular side, which probably 
serve as barriers that prevent the diffusion of ions. This inter- 
rupted pore architecture in ELIC thus depicts a non-conducting 
conformation of a pentameric ligand-gated ion channel, the ther- 
modynamically stable state in the absence of bound ligand. As 
ligand binding promotes pore opening in these ion channels and 
the specific ligand for ELIC has not yet been identified, we have 
turned our attention towards a homologous protein from the cya- 
nobacterium Gloebacter violaceus (GLIC). GLIC was shown to 
form proton-gated channels that are activated by a pH decrease 
on the extracellular side and that do not desensitize after activa- 
tion’. Both prokaryotic proteins, ELIC and GLIC form ion chan- 
nels that are selective for cations over anions with poor 
discrimination among monovalent cations’’, characteristics that 
resemble the conduction properties of the cation-selective branch 


of the family that includes acetylcholine and serotonin recep- 
tors**. Here we present the X-ray structure of GLIC at 3.1 A reso- 
lution. The structure reveals a conformation of the channel that is 
distinct from ELIC and that probably resembles the open state. In 
combination, both structures suggest a novel gating mechanism 
for pentameric ligand-gated ion channels where channel opening 
proceeds by a change in the tilt of the pore-forming helices. 
GLIC was crystallized at pH 4, a condition where the open prob- 
ability of the channel is high, and its structure was determined by 
molecular replacement followed by cyclic non-crystallographic sym- 
metry averaging (Supplementary Figs 1-3, Supplementary Table 1). 
In its overall organization, the structure of GLIC closely resembles 
that of ELIC (Fig. 1): the extracellular domain encompasses 10 
B-strands that are organized as a sandwich of two tightly interacting 
B-sheets, while the transmembrane domain folds into a bundle of 
four a-helices. As for other pLGICs of prokaryotic origin, an 
extended region between helices «3 and «4 is absent and replaced 
by a short loop®. The close correspondence between the ELIC and 
GLIC structures is emphasized in a superposition of their Co posi- 
tions (Fig. 1c). Despite the moderate sequence conservation (20% 
identical residues, Supplementary Fig. 4), both proteins are very 
similar on a structural level. Whereas the elements of secondary 
structure are highly conserved, spatial differences in the ligand- 
binding and the transmembrane domain indicate that the two 


Figure 1| GLIC structure. a, Ribbon 
representation of GLIC viewed from within the 
membrane with the extracellular solution above. 
The approximate membrane boundaries are 
indicated. b, Structure of the pentameric 
channel viewed from the extracellular side. 

c, Superposition of GLIC (green) and ELIC 
(orange, PDB 2vl0) subunits. The protein is 
shown as a Co trace, the subunit is viewed from 
within the membrane. The superposition was 
generated by a least squares fit of the Cx positions 
of the respective «1 helices of the five subunits of 
the pentameric channel. Figures 1-4 were 
prepared with DINO (http://www.dino3d.org). 


Department of Biochemistry, University of Ziirich, Winterthurerstrasse 190, CH-8057 Ziirich, Switzerland. 


115 


©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


proteins adopt two distinct conformations of the ion conduction 
pore (Fig. 2, Supplementary Fig. 5). 

The conformational differences between the two ion channels are 
manifested in their pore geometry: whereas the transmembrane pore 
of ELIC is constricted on its extracellular side, the equivalent region 
of GLIC shows a funnel-shaped opening with a linearly decreasing 
diameter that places its narrowest part at the intracellular entry of the 
channel (Fig. 2c). At this constricted position of GLIC, the ELIC pore 
is dilated to a water-filled cavity of about 6 A in diameter. The mis- 
match in the pore geometry results from differences in the orienta- 
tion of the «2 and «3 helices. With respect to ELIC, both helices in 
GLIC have rotated as rigid unit by about 9° around an axis that 
intersects with residue Val 267 and that runs parallel to the mem- 
brane (Fig. 2a). This rotation results in an outward movement of the 
helix pair away from the pore axis on the extracellular side and an 
inward movement towards the axis at the intracellular entry of the 
channel. In contrast to the large differences in the position of the «2 
and «3 helices, the location of helix «1 that connects the transmem- 
brane pore to the extracellular domain is virtually unchanged (Fig. Ic, 
Supplementary Fig. 5). 

A comparison of the pore region of the nicotinic acetylcholine 
receptor (nAChR)—which was modelled into electron density at 
4A resolution—with the corresponding region in the two bacterial 
channels reveals interesting similarities: the extracellular part of the 
nAChR pore resembles the funnel-shaped structure of GLIC, whereas 
the diameter in the intracellular halfis closer to ELIC (Supplementary 
Fig. 6). Although the nAChR structure has been proposed to repres- 
ent a non-conducting conformation®”’, as it was obtained in the 
absence of ligands, its relation to the two conformations of bacterial 
pentameric ligand-gated ion channels (pLGICs) at high resolution 
remains ambiguous. 

Although the shape of the transmembrane channel in GLIC differs, 
the chemical features that are common to members of this branch of 
the family are preserved (Fig. 3)°. As in other cation-selective pLGICs, 
the extracellular half of the pore is dominated by hydrophobic resi- 
dues, some of which carry bulky side chains. In contrast to the outer 
half, the intracellular part is lined by polar side chains of serine and 
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threonine residues that provide a hydrophilic character to this 
region. Both ends of the transmembrane channel are framed by rings 
of acidic residues that are located at the membrane boundary and 
that contribute to the overall negative electrostatic environment in 
the pore (Fig. 2c). Residual electron density in the water-filled chan- 
nel indicates the presence of partially ordered solvent molecules 
(Supplementary Fig. 7). A ring of glutamate residues on the intracel- 
lular pore entry is remarkable. Equivalent residues in the nAChR, 
termed the ‘intermediate ring of charges’, have previously been iden- 
tified as major determinants of conductance and cation selectivity*”. 
In GLIC, the side chains of the five glutamate residues that are well 
defined in the electron density project into the lumen of the pore and 
form a ring of interacting ionizable groups (Fig. 3a, Supplementary 
Fig. 3b). They define the narrowest part of the channel and resemble a 
narrow selectivity filter, akin to other selective ion channels'°, that 
might directly interact with ions which have stripped part of their 
hydration shell. When investigating crystals that were soaked in solu- 
tions containing the permeant cations Rb*, Cs~ and the divalent 
cation Zn*~, we were able to locate electron density of bound ions 
in the region intra- and extracellular to the constriction, thus 
emphasizing its potential role for conductance and ion selectivity 
in GLIC (Fig. 3b, Supplementary Table 1). 

The tight interactions of the five glutamate side chains defining the 
pore constriction might impede the free diffusion of ions. Because in 
a cellular context the intracellular part of the open channel faces the 
physiological pH of the cytoplasm, it is possible that the protonation 
state of these residues is shifted at low pH and that the interactions 
observed in the electron density are a consequence of the crystalliza- 
tion conditions. When investigated by excised inside-out patch- 
clamp recording, the current passed by the channel decreases upon 
lowering the intracellular pH, as expected when the net charge of the 
protein becomes more positive (Supplementary Fig. 1c). To invest- 
igate the extent to which the observed pore structure might be influ- 
enced by local interactions, we have determined the structure of 
the mutant E221A at 3.5A. The decreased cation selectivity of 
the mutant emphasizes the critical role of this residue for ion con- 
duction (Supplementary Fig. 1d). The structure of the mutant, which 


Figure 2 | Comparison of the pore region. 

a, View of the Cz trace of the «2-03 helices. The 
respective regions of the superimposed structures 
of GLIC (green) and ELIC (orange) are shown. 
The front subunit is removed for clarity. The view 
is from within the membrane. b, Conformation 
of the ~2-«3 helix pair of a single subunit as 
observed in the structures (left) and after a least 
squares fit of the Cx positions of the respective 
helix pair of ELIC onto GLIC (right). The 
rotation axis is indicated. c, Pore radius (r) and 
electrostatic potential (®) in GLIC and ELIC. 
Orientation of GLIC is shown above; vertical 
dashed lines, molecular boundaries; vertical 
dotted lines, transmembrane region. Top plot, r 
of GLIC (green) and ELIC (orange). Bottom plot, 
@ along the pore axis of GLIC (green) and ELIC 
(orange), as calculated from a numerical solution 
of the linearized Poisson—Boltzmann equation at 
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Figure 3 | Pore structure and ion binding. a, View of the 2 helices of GLIC 
defining the pore region. The front subunit is removed for clarity. The 
molecular surface is shown as grey mesh. b, Intracellular part of the pore 
region. Shown are the isomorphous difference electron density of Cs* (grey, 
contoured at 4.5c) and Rb* (blue, contoured at 3.5c), and the anomalous 
difference electron density of Zn?* (red, contoured at 4.5c). The position of 
selected residues is indicated. c, Schematic representation of the pore 
opening mechanism. The «2/«3 helices of two subunits in the closed (left) 
and open (right) conformation are shown. The ion-coordinating glutamate 
residues are shown in red, the permeating ions in blue. The axis of rotation 
(black) in each subunit is indicated. 


crystallized in similar conditions as wild type, showed an unaltered 
overall conformation of the protein backbone with a pore where the 
intracellular constriction is removed (Supplementary Fig. 8). A sim- 
ilar pore structure would be obtained in a case where the glutamate 
side chains change their conformation (Supplementary Fig. 8b, c). 
Our results thus suggest that the overall conformation of the pore- 
forming helices is not perturbed by local interactions of the partly 
protonated residues at low pH, and that minor changes in the side- 
chain conformation are sufficient to increase the pore radius. The 
conical aqueous pore structure and the ability to bind ions at the 
narrow intracellular constriction thus suggest that the structure of 
GLIC either depicts a conducting state of a pLGIC or a conformation 
that is very close to a conducting state (Fig. 3c). 

Although GLIC shares the hallmarks of pLGICs, the ligand that 
triggers its channel opening is unusual. GLIC has been shown to be 
activated by a decrease in the extracellular pH (ref. 2). It is thus 
conceivable that the titration of a group in the consensus ligand- 
binding region causes conformational transitions in the extracellular 
domain that are similar to those induced in response to the binding of 
neurotransmitters in other members of the family''. The ligand- 
binding site of pLGICs is located at the interface between two sub- 
units, and is covered by the strands B9 and B10 (Fig. 4). When 
compared to ELIC and acetylcholine-binding protein (AChBP) this 
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site shows distinctive features. The structure around £9 is only weakly 
conserved, particularly in the loop region connecting B9 and B10 
(which is seven residues shorter than in ELIC) (Fig. 4a, 
Supplementary Fig. 4). Whereas in ELIC the loop is mobile and thus 
only poorly defined in the electron density, its conformation in GLIC 
is well defined with residues being involved in a variety of interac- 
tions, some of which involve ionizable residues (Fig. 4b, 
Supplementary Fig. 3c). Several of these interactions are unique to 
GLIC, such asa salt bridge between Arg 76 and Glu 180 at the position 
where the quaternary ammonium is bound in AChBP’*”*. The tight 
interactions in the loop region of GLIC resemble the conformation 
that has been observed in AChBP in the presence of its substrate 
analogue carbamylcholine’’, except that the pocket for the ligand 
found in AChBP and in the respective position in ELIC is filled with 
protein side chains (Supplementary Fig. 9). The structural features of 
the ligand-binding region are thus in accordance with an activated 
ligand-occupied state of the domain. 

Conformational differences in the conserved regions of the ligand- 
binding domain hint towards transitions that lead to channel open- 
ing. When compared to ELIC, parts of the ligand-binding domain in 
GLIC have undergone considerable structural rearrangements. The 
movements are most pronounced for residues in one of the two 
B-sheets constituting the extracellular domain that was previously 
termed the ‘inner sheet’, part of which has tilted by about 5° towards 
the membrane plane (Fig. 4a). The structural differences between 
ELIC and GLIC also affect the loop regions connecting the strands 
B1/B2 and B6/B7 at the boundary between the cytoplasmic domain 
and the pore. Both regions contact the «2/03 loop in the transmem- 
brane domain and have previously been identified to play a critical 
role in channel opening’*">. Yet, the extent of the movement in the 
extracellular domain is small compared to the large conformational 
difference in the adjacent region of the pore domain, as if the break- 
ing of a critical interaction in the interface would have allowed the 
relaxation of a strained conformation of the channel (Fig. Ic, 
Supplementary Figs 3d, 10). Within the extracellular domain, the 
GLIC structure shows the relationship between conserved residues 
that are critical for channel opening. These residues include Arg 191 
at the end of B10 that forms a pair of salt bridges with Asp 31 in the 
61/82 turn and Asp 121 in the B6/B7 loop (which is part of the Cys- 
loop signature sequence, Fig. 4c). Mutations that remove either of the 
two interactions in the nAChR and other pLGICs drastically decrease 
the open probability of the channel, thus emphasizing the import- 
ance of the respective loop regions for the transduction of conforma- 
tional signals from the ligand-binding site to the pore domain’*"”. 

The pore structure of GLIC is in striking accordance with electro- 
physiological investigations on the conducting conformation of the 
nAChR. Different studies have previously predicted a funnel-shaped 
channel with a short constriction on the intracellular side that har- 
bours a single ion-binding site at the narrowest part of the pore’*””. 
Whereas the structure of ELIC in a non-conducting conformation 
showed a pore that was constricted on the outside and lacked binding 
sites for ions in the pore, the structure of GLIC now displays the 
predicted features. The structure also allows a potential explanation 


Figure 4 | Extracellular domain. a, View of the 
ligand-binding domains in the superimposed 
structures of GLIC (green) and ELIC (orange). 
Proteins are shown as Co representation. 

b, Ligand-binding region of GLIC. Residues in 
the vicinity of the 89/810 loop are shown. The 
view is from the extracellular side. Selected 
residues and their interactions are indicated. ¢, Cx 
representation of the interface region between the 
extracellular domain and the pore in GLIC 
(green) and ELIC (orange) viewed from the 
extracellular side. Selected residues in GLIC are 
shown as sticks, ionic interactions are indicated 
by dotted lines. 
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for the dominant role of the residues constituting the ‘intermediate 
ring of charges’ in channel function*”. These residues, which corre- 
spond to the ion-coordinating glutamates in GLIC, and that are in 
the same position in the homopentameric %-7 nAChR and one helix 
turn towards the cytoplasm in the muscle nAChR, have been iden- 
tified to exert a prevailing influence on ion conduction and cation 
selectivity’ **. The GLIC structure now suggests that the negatively 
charged side-chains directly interact with the partly desolvated per- 
meant cations, rather than merely contributing to the overall nega- 
tive electrostatics in the intracellular part of the pore. The analysis of 
the electrostatic potential of GLIC shows a pronounced energy min- 
imum near the intracellular entry of the open channel that is 
decreased in the closed channel and that was previously observed 
ina functional study that investigated the binding of positive charged 
MTS (methanethiosulphonate) reagents to the nAChR (Fig. 2c)*?*. 
The comparison of the two high resolution structures of pLGICs 
gives structural insight into the mechanisms underlying pore open- 
ing. These mechanisms have been ambiguous so far, and have mainly 
been derived from electron microscopy images at low resolution”. 
Although previous studies have proposed that channel opening pro- 
ceeds by a rotation of the pore-lining helices around their helix axis®, 
the results of our study suggest a different mechanism. Assuming that 
the structures of ELIC and GLIC depict the closed and open confor- 
mations of a pLGIC pore, opening occurs by a change in the tilt of the 
helices «2 and «3 that move as a rigid body around an axis that runs 
parallel to the membrane and that intersects with a residue in «3 
about two-thirds across the transmembrane channel (Fig. 3c). This 
mechanism is in accordance with previously recorded functional 
data****. Our study thus provides a first structural view at high reso- 
lution into how a pLGIC may open and selectively conduct ions. 


METHODS SUMMARY 


GLIC was expressed in Escherichia coli with its amino terminus fused to maltose- 
binding protein (MPB) that was preceded by a signal sequence (PelB). The 
protein was purified from isolated membranes in the detergents n-dodecyl-f- 
p-maltoside or n-tridecyl-f-b-maltoside. MPB was cleaved during purification 
by proteolytic digestion at a specific site located between MBP and the channel 
protein. Crystals were grown at pH 4.0 with addition of 9% PEG 4000, 225 mM 
(NH4)2SO4 and 0.5 mg ml ! E. coli lipids. Data at 3.1 A resolution were collected 
at the X06-SA beamline at the Swiss Light Source (SLS) of the Paul Scherrer 
Institute (PSI) on a Pilatus detector (Dectris) (Supplementary Table 1). The 
crystals were of space group C2 with one homo-pentameric channel in the 
asymmetric unit (Supplementary Fig. 2). The structure was determined by 
molecular replacement using as search model a modified structure of ELIC, with 
all side chains truncated and helix «2 and loop regions at the interface between 
the ligand-binding domain and the channel removed. Phases were improved and 
extended by cyclic fivefold NCS averaging. The final electron density was of very 
high quality and devoid of model bias (Supplementary Fig. 3). All subunits are 
structurally very similar and were refined maintaining strong constraints 
throughout (Supplementary Table 1). The final structure that encompasses resi- 
dues 7-316 is well refined with good stereochemistry and with no outliers in 
disallowed regions of the Ramachandran plot. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 23 June; accepted 26 September 2008. 
Published online 5 November 2008. 


1. Hilf, R. J. & Dutzler, R. X-ray structure of a prokaryotic pentameric ligand-gated 
ion channel. Nature 452, 375-379 (2008). 

2. Bocquet, N. et al. A prokaryotic proton-gated ion channel from the nicotinic 
acetylcholine receptor family. Nature 445, 116-119 (2007). 

3. Karlin, A. Emerging structure of the nicotinic acetylcholine receptors. Nature Rev. 
Neurosci. 3, 102-114 (2002). 

4. Adams, D. J., Dwyer, T. M. & Hille, B. The permeability of endplate channels to 
monovalent and divalent metal cations. J. Gen. Physiol. 75, 493-510 (1980). 

5. Tasneem, A., lyer, L. M., Jakobsson, E. & Aravind, L. Identification of the 
prokaryotic ligand-gated ion channels and their implications for the mechanisms 
and origins of animal Cys-loop ion channels. Genome Biol. 6, R4 (2004). 


118 


NATURE]|Vol 457|1 January 2009 


6. Miyazawa, A., Fujiyoshi, Y. & Unwin, N. Structure and gating mechanism of the 
acetylcholine receptor pore. Nature 423, 949-955 (2003). 

7. Unwin, N. Refined structure of the nicotinic acetylcholine receptor at 4A 
resolution. J. Mol. Biol. 346, 967-989 (2005). 

8. Imoto, K. et al. Rings of negatively charged amino acids determine the 
acetylcholine receptor channel conductance. Nature 335, 645-648 (1988). 

9. Konno, T. et al. Rings of anionic amino acids as structural determinants of ion 
selectivity in the acetylcholine receptor channel. Proc. R. Soc. Lond. B 244, 69-79 
(1991). 

O. Zhou, Y., Morais-Cabral, J. H., Kaufman, A. & MacKinnon, R. Chemistry of ion 
coordination and hydration revealed by a K* channel-Fab complex at 2.0 A 
resolution. Nature 414, 43-48 (2001). 

1. Sine, S. M. & Engel, A. G. Recent advances in Cys-loop receptor structure and 
function. Nature 440, 448-455 (2006). 

2. Brejc, K. et al. Crystal structure of an ACh-binding protein reveals the ligand- 
binding domain of nicotinic receptors. Nature 411, 269-276 (2001). 

3. Celie, P. H. et al. Nicotine and carbamylcholine binding to nicotinic acetylcholine 
receptors as studied in AChBP crystal structures. Neuron 41, 907-914 (2004). 

4. Lee, W. Y. & Sine, S. M. Principal pathway coupling agonist binding to channel 
gating in nicotinic receptors. Nature 438, 243-247 (2005). 

5. Sala, F., Mulet, J., Sala, S., Gerber, S. & Criado, M. Charged amino acids of the 
N-terminal domain are involved in coupling binding and gating in alpha7 nicotinic 
receptors. J. Biol. Chem. 280, 6642-6647 (2005). 

6. Schofield, C. M., Jenkins, A. & Harrison, N. L. A highly conserved aspartic acid 
residue in the signature disulfide loop of the alpha 1 subunit is a determinant of 
gating in the glycine receptor. J. Biol. Chem. 278, 34079-34083 (2003). 

7. Jha, A., Cadugan, D. J., Purohit, P. & Auerbach, A. Acetylcholine receptor gating at 
extracellular transmembrane domain interface: The cys-loop and M2-M3 linker. 
J. Gen. Physiol. 130, 547-558 (2007). 

8. Dani, J. A. Open channel structure and ion binding sites of the nicotinic 
acetylcholine receptor channel. J. Neurosci. 9, 884-892 (1989). 

9. Dani, J. A. & Eisenman, G. Monovalent and divalent cation permeation in 
acetylcholine receptor channels. lon transport related to structure. J. Gen. Physiol. 
89, 959-983 (1987). 

20. Galzi, J. L. et al. Mutations in the channel domain of a neuronal nicotinic receptor 

convert ion selectivity from cationic to anionic. Nature 359, 500-505 (1992). 

21. Corringer, P. J. et al. Mutational analysis of the charge selectivity filter of the 
alphaz nicotinic acetylcholine receptor. Neuron 22, 831-843 (1999). 

22. Gunthorpe, M.J.& Lummis, S. C. Conversion of the ion selectivity of the 5-HT (3a) 
receptor from cationic to anionic reveals a conserved feature of the ligand-gated 
ion channel superfamily. J. Biol. Chem. 276, 10977-10983 (2001). 

23. Pascual, J. M. & Karlin, A. State-dependent accessibility and electrostatic 
potential in the channel of the acetylcholine receptor. Inferences from rates of 
reaction of thiosulfonates with substituted cysteines in the M2 segment of the 
alpha subunit. J. Gen. Physiol. 111, 717-739 (1998). 

24. Wilson, G. G., Pascual, J. M., Brooijmans, N., Murray, D. & Karlin, A. The intrinsic 
electrostatic potential and the intermediate ring of charge in the acetylcholine 
receptor channel. J. Gen. Physiol. 115, 93-106 (2000). 

25. Unwin, N. Acetylcholine receptor channel imaged in the open state. Nature 373, 
37-43 (1995). 

26. Paas, Y. et al. Pore conformations and gating mechanism of a Cys-loop receptor. 
Proc. Natl Acad. Sci. USA 102, 15877-15882 (2005). 

27. Cymes, G. D., Ni, Y. & Grosman, C. Probing ion-channel pores one proton at a 
time. Nature 438, 975-980 (2005). 

28. Cymes, G. D. & Grosman, C. Pore-opening mechanism of the nicotinic 
acetylcholine receptor evinced by proton transfer. Nature Struct. Mol. Biol. 15, 
389-396 (2008). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank B. Blattmann and C. Stutz-Duocommun for 
assistance with crystal screening, C. Schulze-Briese and the staff of the XO6SA 
beamline for support during data collection, the protein analysis group at the 
functional genomics centre of the University of Zurich for help with mass 
spectrometry, R. MacKinnon for comments on the manuscript and members of the 
Dutzler laboratory for help in all stages of the project. Data collection was 
performed at the Swiss Light Source of the Paul Scherrer Institute. The research 
leading to these results was supported by a grant from the National Center for 
Competence in Research (NCCR) in Structural Biology and by an EC FP7 grant for 
the EDICT consortium (HEALTH-201924). R.J.C.H. is affiliated with the Molecular 
Life Sciences Ph.D. programme of the University/ETH Zurich. 


Author Contributions R.D. and R.J.C.H. designed the project. R.J.C.H. carried out all 
experiments. R.D. assisted in data collection and structure determination. R.D. and 
R.J.C.H. jointly wrote the manuscript. 


Author Information Coordinates of GLIC have been deposited in the Protein Data 
Bank under accession number 3EHZ (WT), and 3EIO (E221A). Reprints and 
permissions information is available at www.nature.com/reprints. 
Correspondence and requests for materials should be addressed to R.D. 
(dutzler@bioc.uzh.ch). 


©2009 Macmillan Publishers Limited. All rights reserved 


doi:10.1038/nature07461 


METHODS 

Expression and purification. The gene encoding GLIC was cloned into a modi- 
fied pET26b vector (Novagen) where it was fused to the carboxy terminus of MBP, 
generating a construct with the pelB signal sequence followed by a (His)1-tag, 
MBP, a Herpes simplex (HRV) 3C protease site (GE Healthcare) and the GLIC 
sequence. BL21(DE3) cells transformed with this construct were grown at 37 °C in 
TB medium containing 50 mg] ' kanamycin to an Agoo of ~1.6-1.8. Expression 
was induced by addition of 0.2 mM IPTG overnight at 20 °C. All following steps 
were carried out at 4 °C. Cells were harvested and lysed with an Emulsiflex high 
pressure homogenizer (Avestin) in 50mM potassium phosphate (pH 8.0), 
150mM NaCl (buffer A), with the addition of 1mM phenylmethyl sulphonyl 
fluoride (PMSF), 20 pgml~' DNase I, 1 mg ml lysozyme, 1 1g ml’ pepstatin 
and 1pgml~' leupeptin. The lysate was cleared by low-spin centrifugation. 
Membranes were isolated by ultracentrifugation and the proteins were extracted 
in buffer A containing 2% n-dodecyl-B-p-maltoside (DDM, Anatrace). After 
centrifugation, the protein was purified by affinity chromatography on a Ni- 
NTA column (Qiagen). The purified MBP-GLIC-fusion protein was digested 
with HRV 3C protease to cleave the Hisjo-MBP protein and dialysed against 
10 mM potassium phosphate (pH 8.0), 150 mM NaCl and either 0.5 mM DDM 
or 0.2mM n-tridecyl-B-p-maltoside (TDM, buffer B). Hiso-MBP was subse- 
quently removed from solution by binding to Ni-NTA resin. GLIC was concen- 
trated and subjected to gel filtration on a Superdex 200 column (GE Healthcare) 
in buffer B. The protein peak corresponding to the GLIC pentamer was pooled 
and concentrated to 10mg ml |, and used for crystallization. 

Crystallization. GLIC was crystallized in sitting drops at 4°C. Crystals were 
obtained by mixing protein containing additional 0.5 mg ml! E. coli polar lipids 
(Avanti Polar Lipids) in a 1:1 ratio with reservoir solution containing 225 mM 
(NH,)2SO,, 50 mM sodium acetate (pH 4.0), 9-12% PEG 4000. For the crystal- 
lization of the GLIC mutant E221A, the reservoir solution contained 500 mM 
(NH4)2SO,, 50mM sodium acetate (pH 4.0), 9-12% PEG 4000. For cryo- 
protection, the crystals were transferred into mother liquor containing 30% 
ethyleneglycol and flash-frozen in liquid propane. For soaking in 100mM of 
Rb* and Zn?*, crystals were incubated in mother liquor where NaCl was 
replaced by the respective chloride salt; for soaking in high (250 mM) concen- 
tration of Cs‘ ions, crystals were incubated in mother liquor where (NH4) SO, 
and NaCl were replaced by Cs2SO4. 

Structure determination. Data sets were collected on frozen crystals on the 
X06SA beamline at the Swiss Light Source (SLS) of the Paul Scherrer Institut 
(PSI) on a PILATUS detector (Dectris). The data were indexed, integrated and 
scaled with the programs XDS”, MOSFLM” and SCALA”! and further processed 
with CCP4 programs” (Supplementary Tables 1 and 2). Initial phases were 
obtained by molecular replacement with PHASER*® using a partial structure 
of ELIC as search model. All side chains of the ELIC pentamer were truncated 
to alanine and helix «-2 and the loop regions in the interface between the two 
domains were removed. Phases were improved and extended to 3.1 A by fivefold 
NCS symmetry averaging in DM™*. The model was built in O* and initially 
refined maintaining strict fivefold NCS constraints in CNS*° and PHENIX”. 
In later stages, the strict constraints were loosened and restraint individual 
B-factors were refined. R and Reece were monitored throughout. Rge<e was calcu- 
lated by selecting 5% of the reflection data in thin slices that were omitted in 
refinement. The final model has R/Rg,.¢ values of 23.8% and 26.6%, good geo- 
metry and no outliers in the Ramachandran plot (Supplementary Table 1). The 
pore radii were calculated with the program HOLE”. The molecular surface was 
calculated with MSMS”. 

Poisson—Boltzmann calculations. Electrostatic potential was calculated by solv- 
ing the linearized Poisson—Boltzmann equation in CHARMM**! on a 
110 X 110 150A grid (1 A grid spacing) followed by focusing on a 
90A xX 90AX 140A grid (0.5A grid spacing). Partial protein charges were 
derived from the Param19 extended hydrogen atom force field. Polar hydrogen 
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positions were generated in CHARMM. The protein was assigned a dielectric 
constant of 2. Its transmembrane region was embedded in a 30-A-thick slab 
(= 2) representing the membrane that contained a cylindrical hole around 
the water-filled pore region and was surrounded by an aqueous environment 
(e = 80). Calculations were repeated with mobile ion concentrations of 0 and 
150 mM of monovalent salt in the solvent region excluding the transmembrane 
channel. 

Electrophysiology. Constructs containing the gene of GLIC and the mutant 
E221A preceded by the signal sequence of the human «7-nAchR were cloned 
into the pTLN vector for expression in Xenopus laevis oocytes”. After lineariza- 
tion of the plasmid DNA by Mlul, capped complementary RNA was transcribed 
with the Message Machine kit (Ambion) and purified with the RNeasy kit 
(Qiagen). For expression, 1-50ng of RNA was injected into defolliculated 
oocytes. Two-electrode voltage-clamp measurements were performed 2-3 days 
after injection at 20°C (OC-725B, Warner Instrument Corp.; Supplementary 
Fig. 1). Currents were recorded in bath solutions containing 130 mM NaCl, 
2mM KCl, 1.8mM CaCl, and 5mM MgCl). The respective pH values were 
stabilized by addition of 10mM Hepes (pH 7.0) or 10mM of sodium citrate 
(pH 4.0-5.5). For inhibition studies 5 mM of TBA was added. Low salt solutions 
contained 30 mM NaCl, 100mM mannitol, 2mM KCl, 1.8mM CaCl, 1mM 
MgCl, and 10 mM sodium citrate at pH 4.0. Inside-out patch-clamp measure- 
ments were recorded 3 days after injection with an Axopatch 200B amplifier 
(Axon Instruments; Supplementary Fig. 1c). Electrodes had a resistance of 
1-3 MQ. Bath solution contained 150mM NaCl, 1mM CaCl, and 5mM 
MgCl). The respective pH values were stabilized by addition of 10 mM Hepes 
(pH 7.0) or 10mM of sodium citrate (pH 4.0, pH 5.0). The pipette solutions 
contained 150mM NaCl, 1mM EGTA and 5mM MgCl and 10mM sodium 
citrate pH 4.0. 
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t atime when financial markets are shaky, science career opportunities 

uncertain and the plight of the serial postdoc as much of a concern 

as ever, this year’s Postdoc Journal keepers offer hope. In the past 

few months two of them have earned permanent positions. One has 
embarked on the challenge of motherhood. The fourth has found a research position 
that, although immensely challenging, always enthralls her. Each sums up their 
experiences and future ambitions in online essays (see http://tinyurl.com/57hqy7). 

Jon Yearsley, a veritable postdoc-aholic of ten years’ experience, managed to live 
up to one of his New Year's resolutions from last year: to have a permanent position 
or no position at all. He recently landed a lecturing job in Dublin and was joined by 
his partner after much time apart. Of course, challenges remain: will his brand of 
interdisciplinary research attract funding? What about students? 

New responsibilities have proven daunting for Zachary Lippman, who has just 
moved to Cold Spring Harbor Laboratory in New York as an assistant professor 
after a postdoc in Israel. He knows he must generate preliminary data and write 
grant proposals, but he remains wistful about his past experiences and colleagues. 

Aliza le Roux started the year with a research trek of epic proportions. She went 
from her native South Africa to her new principal investigator's lab at Michigan 
State University before, in short order, heading to Ethiopia for her new postdoc. She 
is fascinated every day by the natural soap opera performed by a troop of baboons 
she studies there. Le Roux loves her job, she says unabashedly. She gets to play with 


animals and chase her own intellectual questions. But she worries that her work 
might make a ‘normal’ life a difficult prospect. 
Amanda Goh, on the other hand, has taken a step towards that so-called normal 

life. Goh became quite frustrated with her lab work this year; she contemplated 
starting a completely new project. Now, as she prepares for the birth of her first 
child, she already recognizes that the careful planning she strives for in the lab may 
be even more difficult to carry out in the nursery. 
Gene Russo is editor of Naturejobs. 
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Gravitational astronomy 101 


It's a bit of ashock. 


David Blair 


I want to start this course with a brief review 
of the history of gravitational astronomy, 
starting from the Eta Carinae supernova of 
2009. You will be in mid-career when the 
shock front hits, and a good understanding 
of GA will be useful for anyone remotely 
involved in the Shield Project. 

The UN declared 2009 to be an Interna- 
tional Year of Astronomy, in honour of the 
400th anniversary of Galileos first observa- 
tions with a telescope. Just 5 years before 
Galileo discovered craters on the Moon 
and Jupiter’s moons, Kepler had recorded 
a supernova. From that date until 2009, for 
the entire time since the invention of the tel- 
escope, there had been a complete absence 
of supernovae in our Galaxy. Astronomers 
thought this very unfair. If you had been 
born in the year 1000 you could have seen 
two supernovae in your lifetime, and the 
same if you had been born in 1550. But the 
unfairness of the 400-year absence is noth- 
ing compared with the unfairness of the Eta 
Carinae supernova. For Earth, the coming 
shock is a one-in-a-million-year event. 

But I digress. Let’s focus on the gravita- 
tional astronomy. Over the years before the 
EC supernova, there had been many years 
of painstaking development of gravita- 
tional-wave detectors. The world detector 
array that now runs autonomously was in 
its infancy. There was not even one South- 
ern Hemisphere detector. The angular 
resolution (which we will derive in the next 
lecture) was lousy. 

The EC supernova was a major embar- 
rassment to science. Even though electro- 
mag astronomers had been predicting it 
for half a century, no one noticed when it 
actually happened. The gravity signal was 
so different from any predicted waveform 
that it too was only identified months after 
the event. So the most momentous astro- 
nomical event in human history — one 
that, as you all know, threatens to put an 
end to human history — was ignored for 
days, until a few amateurs noticed the 
changes in Eta Carinae. The trouble was 
that Eta Carinae was very special and very 
rare: one star among 100 billion that was 
more than 100 times as heavy as our Sun. 
Its light was blanketed by the nebula and 
the prompt y-ray burst lit up only a corner 
of the nebula. 

Getting an accurate distance to EC was 
the key to predicting the shock arrival, 
which as you all know is expected in 
March ’81. Like all gravity signals, the EC 


signal gave us its distance, which is 7,239 
light years. You would think that was suf- 
ficiently far away for Earth to be safe. To 
see why it’s not, you have to understand 
geons and strong-field nonlinear coupling 
in supersymmetric loop quantum gravity. 

Geons were discovered theoretically by 
John Archibald Wheeler in 1955. They 
are gravitationally bound entities of pure 
gravitational energy. Most astrophysicists 
believe that they could not actually exist, 
despite Gell-Mann’s totalitarian principle, 
which says ifit is not prohibited then it is 
compulsory. Geons are not prohibited by the 
laws of physics so they must be out there. 
But Wheeler did not consider gravitino- 
neutralino coupling. This causes strong- 
field gravitational wavefronts to gravita- 
tionally condense into geon-like bubbles 
that propagate at about 0.99c. By the 
end of the course you should be able to 
derive the threshold gravitational-wave 
amplitude for the creation of gravitino- 
neutralino geons. 

The year 2009 broke the supernova 
drought but it also gave us the great flood 
that is on its way. And for physicists it 
broke the drought in our understand- 
ing of the Universe. In 2009, physics had 
been derailed by an impossibly complex 
theory they called string theory. It was 
like Ptolemy’s theory of the Solar System 
that derailed astronomy for more than a 
millennium. While physicists used string 
theory to try to solve an apparent incom- 
patibility between quantum mechan- 
ics and gravity, astronomers were 
invoking stuff they called ‘dark 
matter and ‘dark energy’ to 
explain galaxy rotation and 
cosmic expansion. Physics 
was ina mess. The key to 
the great synthesis was 
in the 2009 gravity-wave 
data. The breakdown of 
general relativity at the 
black-hole event horizon 
seems obvious today, but 
it took Otulu 20 years to 
figure it out. 

It is the nonlinearity of 
classical general relativity 
that causes the instabilities 
that turn shock fronts into 
cosmic bullets. This is the 
fundamental cause of the 
catastrophe of the coming 
shock and you must under- 
stand it in detail. Atthe end 
of the course I will expect 
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you to be able to derive both the gravitino- 
neutralino shock-front structure as well as 
its peak intensity and arrival time from the 
gravity-wave data, using Otulu’s strong-cou- 
pling theory. You will need to understand 
how the scalar field drives the coherent 
decay of the shock when it interacts with 
stellar plasma. You will need to be familiar 
with the extragalactic data that confirmed 
the theory, and be able to show how propa- 
gating gravitino-neutralino-geon conden- 
sates explain both cosmic-expansion data 
and galaxy rotation. This will test your 
understanding of the great synthesis. 

The absence of other galactic civiliza- 
tions is probably due to their failure to 
understand gravitino-neutralino geons. I 
hope that some of you will go on to work 
for the Shield Project and make sure we do 
not follow in their footsteps. 

Now, let’s get to work. The first slide 
shows the Otulu equation ... a 
David Blair is director of the Australian 
International Gravitational Research 
Centre at the University of Western 
Australia, a co-founder of the Gravity 
Discovery Centre (www.gdc.asn.au) and 
Western Australia's Scientist of the Year. 
This story was inspired by discussions at 
CECS Chile. 
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